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ABSTRACT 

These notes contain the written version of lectures given at the 2009 Les 
Houches Summer School "Modern perspectives in lattice QCD: Quantum field 
theory and high performance computing." The goal is to provide a pedagogi- 
cal introduction to the subject, and not a comprehensive review. Topics covered 
include a general introduction, the inclusion of scaling violations in chiral pertur- 
bation theory, partial quenching and mixed actions, chiral perturbation theory 
with heavy kaons, and the effects of finite volume, both in the p- and e-regimes. 



I. WHY CHIRAL PERTURBATION THEORY FOR LATTICE QCD? 



It is often claimed that lattice QCD provides a tool for computing hadronic quantities 
numerically, with fully controlled systematic errors, from first principles. This assertion is 
of course based on the fact that lattice QCD provides a nonperturbative definition of QCD 
(in fact, the only one to date). So, why not choose the parameters (in particular, the quark 
masses) at, or very near, their physical values, and compute all quantities of interest? 

There are at least two major obstacles to this. The first obstacle is that lattice QCD 
is formulated in euclidean space, rather than in Minkowski space. This is a necessary 
restriction if one wants to use Monte Carlo methods in order to evaluate expectation values 
of operators from which one extracts physical quantities. In euclidean space correlation 
functions do not give direct access to physical scattering amplitudes; they first need to be 
continued to Minkowski space. If one is interested in physics at some scale for which an 
effective field theory (EFT) is available, one can use this EFT in order to match to euclidean 
lattice correlation functions in the regime of validity of the EFT. In the case of the chiral 
EFT for QCD, the form of the correlation functions is predicted in terms of a finite number 
of coupling constants to a given finite order in a momentum expansion. Once the value of 
these is known from lattice QCD computations, the EFT can then be used to continue to 
Minkowski space. Chiral perturbation theory (ChPT) provides this EFT framework for the 
low-energy physics of the (pseudo-) Nambu-Goldstone bosons of QCD. 

A second obstacle is that in practice lattice QCD computations are carried out at values 
of the up and down quark mass larger than those observed in nature, 1 because of limits on 
the size of the physical volumes that can be handled with presently available computers, 
and because of the rapid increase in algorithmic cost with decreasing quark masses. If the 
quark masses are nevertheless still small enough for ChPT to be applicable, it can be used 
to extrapolate to the physical values for these light quark masses, again using ChPT to 
connect an "unphysical" lattice computation (namely, at values of the light quark masses 
larger than their real- world values) to physical quantities (those observed in nature). 

It should be said that lattice QCD is moving toward the physical point, i.e. , that nu- 
merical computations are being done with the light (up and down) quark masses at or very 
near the physical values. If lattice quark masses are very near the physical values, simple 
smooth extrapolations are in principle enough to obtain hadronic quantities of interest at 
the physical point, and the use of ChPT may become less important in this respect. I will 
return to this point below. 

Both of the obstacles described above are examples of how ChPT (and, in general, EFTs) 
can be used to connect "unphysical" computations with physical quantities. This is partic- 
ularly helpful in the case of lattice QCD, where often computations are easier, or even only 
possible, in some unphysical regime. The use of ChPT is by no means limited to extrapola- 
tions in quark masses, or continuation from euclidean space. Other examples of the use of 
ChPT to bridge the gap between the lattice and the real world are, in increasing order of 
boldness: 

1. Nonzero lattice spacing. While in state-of-the-art lattice computations the lattice 
spacing is small (in units of Aqcd), scaling violations are still significant. It is possible 



The strange quark mass can be taken at its physical value, although it is theoretically interesting to see 
what happens if one varies the strange quark mass as well. 
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to extend ChPT to parametrize scaling violations afflicting the physics of Nambu- 
Goldstone bosons. In particular, depending on what lattice fermions one uses, chiral 
symmetry (and rotational invariance!) are broken on the lattice, and such breakings 
can be parametrized in ChPT through new coupling constants that vanish in the 
continuum limit. This will be the topic of the second lecture, Sec. IIHI Another 
example in this category is the use of a finite volume in numerical computations. The 
dominant effects of this can also be studied in ChPT, as we will see in Sec. I VI I 

2. Partial quenching and the use of mixed actions. As we will see, on the lattice one 
is free to choose the valence quark masses to be different from sea quark masses. 
This is of course completely unphysical (it violates unitarity!), but it turns out to 
be very practical - it is (often much) cheaper to vary only valence quark masses 
than to vary the sea (or "dynamical") quark mass, which affects the ensemble of 
gauge configurations. Moreover, as we will see, varying valence and sea quark masses 
independently from each other gives us extra handles on the theory, making it easier 
to determine the couplings of ChPT than it would be without this possibility. This 
trick is known as "partial quenching." 2 One may even go further: one can use different 
discretizations for sea quarks (typically a computationally less expensive method, such 
as staggered or Wilson quarks) and for valence quarks (a discretization using lattice 
fermions with very good chiral and flavor symmetries — operator mixing depends 
primarily on the symmetries of those operators, which are constructed out of the 
valence quarks). This generalization of partially-quenched QCD (PQQCD) is now 
often referred to as "mixed-action" QCD. We will discuss these methods in the third 
lecture, Sec. HV] 

3. Other unphysical constructions used in lattice QCD; a prominent example is the use of 
"fourth-rooted" staggered fermions. Again, EFT techniques provide much inside into 
the nature of this method, and the comparison of EFT calculations with numerical 
results can be used to test the validity of such methods. 

Even if one is only interested in physical quantities which can directly be computed on 
the lattice, such as (stable) hadron masses at the physical values of the quark mass for a 
very small lattice spacing, ChPT may still be used to test the results. By varying the quark 
mass away from the physical value, one might discover that the numerical results do not 
match the predictions of continuum ChPT, for instance because the lattice spacing is still 
too large, the volume too small, or because of some other systematic effect. In other words, 
ChPT provides a useful tool for the validation of results obtained with lattice QCD. 

Extending this latter argument, it can also be very interesting to use lattice QCD as a 
laboratory to explore what strong interaction physics would look like at parameter values 
different from those in the real world. Again, interpreting the results of numerical com- 
putations in lattice QCD through the help of EFT techniques can be very useful in this 
respect. An example of this is the question about whether it is possible that the up-quark 
mass m u could be zero. 3 This question has now been answered with the help of lattice QCD 



2 "Quenched QCD" corresponds to the case in which the sea quark masses are taken to infinity, making the 
fcrmion determinant constant. This is of course unphysical "beyond repair": all effects of quark loops are 
irrevocably gone. 

3 Which would solve the strong CP problem. 
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computations at nonzero m u (the answer is that m u = is now exluded with a very high 
confidence level). This is only possible with assistance of ChPT, which makes it possible to 
extrapolate to what physics would have looked like at m u = 0, from computations done at 
nonzero values of m u . 

Finally, with limited computational resources, there is the important question as to how 
to use them most judiciously. Does one use all resources to do a few, very expensive, 
computations at physical quark masses, in very large volume? While this sounds very 
attractive, restrictions on available computational resources may force one to choose only 
larger lattice spacings, and one thus pays with the increase of one systematic error for the 
removal of other ones. Or does one try to strike a balance between the various systematic 
errors which afflict lattice computations, by using larger-than-physical quark masses, and 
less expensive fermions with symmetry properties that are not as nice? What choices to 
make depends very much on the physics one is interested in, and they are to some extent 
subjective. However, ChPT and its extension to unphysical situations helps us with making 
such choices, and it therefore is, and will likely remain, a very important tool for lattice 
QCD practitioners. 

The aim of these lectures is not to give a review of all the work that has been done in 
ChPT and its applications to lattice QCD. Rather, the aim is to provide a basic introduction 
to ChPT, with emphasis on its applications to lattice QCD, geared at readers with a general 
knowledge of quantum field theory and the basics of lattice QCD. The list of references 
included is heavily biased toward the papers from which I learned the subject. It will not 
be possible to refer to all papers that have been written on this and closely related topics 
and applications; there are probably at least a few hundred more than the references I did 
include. The references I did include are primarily those in which new concepts or methods 
are introduced and explained, and I strongly recommend them for delving deeper into the 
many aspects and applications of ChPT in its relation to QCD on the lattice. 

It is not even possible to cover all the relevant and important applications that have 
been considered in the literature. Notable examples of topics left out are applications to 
baryons (including EFTs for two-nucleon systems), and to hadrons containing heavy quarks, 
although some aspects of these topics are shared by "heavy-kaon" ChPT, which we consider 
in Sec. |V] The topics I have chosen to include are a general introduction to ChPT in Sec. [TTl 
the incorporation of scaling violations in ChPT in Sec. IIII[ partially quenched and mixed- 
action ChPT in Sec. IIVI two-flavor ChPT with a heavy kaon in Sec. |V] and ChPT in finite 
volume in Sec. I VI I 

Good reviews of continuum chiral perturbation theory can for instance be found in 
Ref. [H, and, at the level of a textbook, Refs. [2], Hj|. For a more general introduction 
to EFT methods, including ChPT, see Ref. Q. For applications of EFT methods to lattice 
QCD, see Ref. |5j; for a more detailed review of ChPT in the context of lattice QCD, see 
Ref. [6|. Another lattice-oriented introduction, with applications of ChPT to lattice calcu- 
lations of weak matrix elements, is Ref. [3]. For a recent review of applications of ChPT to 
hadron phenomenology, see Ref. j8j. 

II. CONTINUUM CHIRAL PERTURBATION THEORY 

We begin with a review of ChPT in the continuum, in infinite volume. This is a vast 
subject in itself, with many important applications to phenomenology. I will not review 
any of those here; the idea is to get an understanding of the basics, which we will need for 
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applications to lattice QCD. 



A. Chiral symmetry 

Let us consider QCD with Nj = 3 flavors, the up, down and strange quarks. (We will 
also have reason to consider the Nf = 2 theory, with only up and down quarks, in situations 
where the strange quark can be considered heavy.) Of course, heavier quarks exist, but we 
will be interested in physics at energy scales well below the charm mass, so we may consider 
the theory in which the charm, bottom and top quarks have been decoupled by integrating 
them out. In deference to the lattice, we will work in euclidean space throughout these 
lectures, continuing back to Minkowski space when necessary. The fermion part of the QCD 
lagrangian is 

C = q L pq L + q R pq R + q L Mq R + q R M^q L , (2.1) 

in which g i; i — u, d, s is the three-flavor quark field. The subscripts L and R denote left- 
and right-handed projectors: 

Ql = ^(1 - 75) q , q R = 3(1 + 75) q , (2.2) 

q L = q\( 1 + i5), q R = q\( l -is)- 

Note that we may define q L R as we like, since the Grassmann fields q and q are independent 
in euclidean space; we have chosen them to be consistent with q — > q^ in the operator 
formalism. M is the quark mass matrix, equal to 



M= m d . (2.3) 




For massless QCD, with M — 0, the lagrangian is invariant under the symmetry group 
U(3) L xU(3) R : 

qi ->■ U L q L , q R -)• U R q R , (2.4) 
1l -> IlUl , -> IrUr , 

with Ul,r G U(3)l,r- We observe that the full QCD lagrangian is invariant if we also let M 
transform as 

M -> U L MU R . (2.5) 

Of course, the quark masses do not transform, but this "spurion" trick, of letting some 
parameters transform under the symmetry group, will be useful later, for instance in 
Sec. Ill El Note that in euclidean space, L (with Eq. (12. 5p ) is invariant under a larger 
group, GL(3,C)l x GL(3,€) R . (If we consider this larger group, Ul and U R in Eqs. (J23J 
and (I2.5p have to be replaced by L7J 1 and U^ 1 , respectively.) But if we identify q with g^7°, 
this reduces this larger group to U{3)l x U(3) r , from which it can be shown that the larger 
symmetry group in euclidean space has no additional physical consequences. 

It is well known, of course, that "axial U(l), v i.e. , transformations with Ul = U R = 
exp(i6)l, are not preserved when QCD is quantized, even when M — 0. The actual sym- 
metry group is therefore SU(3)l x SU(3) r x £7(1), where the U{1) factor is quark number. 
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On mesons, the topic of these lectures, quark-number U(l) is trivially realized, and we may 
drop it from consideration. 

In the real world, we find that there exists an octet of pseudoscalar mesons with masses 
smaller than any other hadron masses. Moreover, one observes that relations between their 
masses and interactions are reasonably well described by assuming an approximate SU(3) 
flavor symmetry. 4 Under this SU(3), the pseudoscalar mesons form an octet 



{ K- K° -fj 




(2.6) 



where we also indicated the (valence) quark content. 5 (In the theory with only two light 
flavors, reduces to the two- by-two upper left-hand block, with the r\ omitted.) Under 
SU (3) this octet transforms as 

<p U<pU ] , (2.7) 

and it is clear that this SU{3) can be identified with the diagonal subgroup SU(3)v of 
SU{3) L x SU{3) R , for which U L = U R = U in Eq. fl2^j) . 

If the full symmetry group SU(3)l X SU(3)r were realized "manifestly" in nature, we 
would observe larger hadronic multiplets, and in particular, we would observe "parity part- 
ners," i.e. , pairs of hadrons with the same mass but opposite parity, since parity takes 
left-handed quarks into right-handed ones and vice versa. 6 Such parity partners are in gen- 
eral not observed, and specifically, there are no scalar mesons with masses near those of 
the pseudoscalar multiplet of Eq. (12. 6p . Instead, it is universally believed (with very strong 
evidence from both the real world and lattice computations) that, in a (hypothetical!) world 
with massless quarks, the chiral group is spontaneously broken to the diagonal subgroup, 
SU(3)l x SU(3)r — > SU(3)v- This requires eight Nambu-Goldstone bosons (NGBs), which 
are identified with the pions, eta and kaons of Eq. (I2.6p . Since the broken generators distin- 
guish between left- and right-handed quarks, these NGBs have to be pseudoscalars. In the 
real world, quarks are not massless, but if the quark masses m u ^, s are small compared to 
Aqcd, they can be treated as a perturbation (as we will see below), giving the NGBs a small 
mass. Indeed, the pions have a very small mass of about 140 MeV, while the kaons and eta 
have a mass of about 500 MeV, which is at least smallish compared to other hadron masses. 
For a proof that SU(3) y does not undergo spontaneous symmetry breaking in continuum 
QCD, see Ref. Q. 

Our interest in these lectures will be in the physics of these NGBs, at energies below those 
at which any other hadrons can be produced. We therefore expect that it should be possible 
to find an EFT for the physics of NGBs. What that means is that it should be possible to 
write down a local lagrangian in terms of the field <fi that, in some systematic approximation, 
reproduces correlation functions involving only NGBs, with restrictions imposed by the 
principles of quantum field theory. The chiral lagrangian is precisely this EFT. 



4 We will return to the issue that the strange quark mass is much larger than the up and down quark 
masses, see Sec. [V] 

5 We ignore isospin breaking, which causes the fields 7r° and rj of Eq. (|2.6p to mix, in most of these lectures. 

6 In mathematical terms, parity is an automorphism of the group SU(3)l x SU(3)r. 
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B. The chiral lagrangian 



Let us therefore start with a closer look at spontaneous symmetry breaking of chiral 
symmetry, for the case that M — 0. An order parameter for this breaking is the renormalized 
condensate 

(VrnVLj) = (q u <lRj) « A QCD$ij > ( 2 - 8 ) 
where the power of Aq C d estimates the magnitude of the condensate. As is always the case 
with spontaneous symmetry breaking, there is a manifold of equivalent vacua, and indeed, 
one may rotate this condensate as 

% = (quq Rj ) -> U L QU R , (2.9) 

where I indicated how Q transforms under the chiral group (here I sum over spin and 
color indices, so that Q is a color and spin singlet). These vacua are all equivalent, and 
they are rotated into each other by elements of the coset SU(3)l x SU(3)r/SU(3)v, which 
happens to be isomorphic to the group SU(3). The standard choice of Eq. (I2.8P leaves 
the diagonal subgroup SU(3)v invariant, but this does not mean that with another choice 
of condensate there would be no SU(3) invariance: the unbroken SU(3) would simply be 
differently embedded in SU(3) L x SU(3) R . 7 

The color-singlet operators ti(TqLiq R j) and their parity conjugates, when acting on the 
vacuum, create mesons with flavor quantum numbers corresponding to the flavor indices i 
and j, and spin-parity corresponding to the matrix T, which is some product of Dirac gamma 
matrices. Since we are interested here in scalar and pseudoscalar mesons, we choose T = 1. 
Introducing a complex scalar field Hij with the same quantum numbers, we can write down 
an effective lagrangian of the form 

C eff = trid^O^H) + C int (H, fft) + C other (H, H\ other hadrons) . (2.10) 

This lagrangian should obey the same symmmetries as the QCD lagrangian, and in par- 
ticular, it should be invariant under SU(3)l x SU(3) r , with H — > UlHU r . We can now 
decompose H = i?E with R hermitian and positive, and E unitary. Symmetry breaking with 
the pattern SU (3) l x SU (3)r — > SU(3)v implies that R picks up an expectation value, which 
again we can take to be proportional to the unit matrix, as in Eq. (I2.8p . X parametrizes 
the vacuum manifold, and thus describes the NGBs predicted by chiral symmetry breaking. 
If all other hadrons are massive, and we are interested only in the physics of NGBs below 
the typical hadronic scale, we can integrate out all non-Goldstone fields, leaving us with a 
low-energy effective lagrangian in terms of only S. This lagrangian is local at scales below 
the hadronic scale. Because of the anomalous axial U(l), the meson associated with this 
symmetry is also heavy, and we may take S in 5*^(3) rather than U(3), and parametrize it 
as 

E = exp (2i<f>/f) -)• U L m R , (2.11) 

with the field of Eq. ( 12. 6p . The field E inherits its transformation under the chiral group 
from the chiral transformation of H, as indicated above. We inserted a dimensionful pa- 
rameter / oc Aqcd so that the field has the canonical mass dimension of a scalar field. 



7 If we choose some other value f2 for the condensate, the subgroup leaving this invariant is the subgroup for 
which Ul = ^lUnfl -1 , which is isomorphic to SU(3)v- Note that for a different value of ft the definition 
of parity would also need to be modified. 
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Observe that E precisely parametrizes the vacuum manifold as in Eq. (I2.9P for the choice 
Qij = —(qq)5ij/(2Nf). The unbroken group SU(3)v is linearly realized: Eq. (12. lip implies 
indeed that <fi transforms as in Eq. (12. 7p under SU(3)y. This is not true for the full chiral 
group: the field transforms nonlinearly under all symmetries outside the subgroup S77(3)y. 
For more discussion, see Sec. [V] 
Our lagrangian thus simplifies to 

C eff = -^tr^E^E) + £ int (E, E*) (2.12) 

where I have normalized the first term such that, upon expanding E in terms of 0, the meson 
fields of Eq. (12.61) have properly normalized kinetic terms. While this is progress, we clearly 
need more input to turn this into any practical use. 

The EFT does not have to be renormalizable. In fact, one expects the presence of a 
cutoff of order 1 GeV, because we integrated out all other hadrons, with masses of order 
1 GeV and up. But we do expect that pion 8 interactions obey all fundamental properties 
of quantum field theory: unitarity and causality, crossing symmetry, clustering and Lorentz 
invariance [13], of course all to the precision with which the EFT reproduces the physics 
predicted by QCD. We will assume 9 that all these properties are satisfied if we take C e jf 
to be the most general local, Lorentz (or euclidean) invariant function of the field E and its 
derivatives [10]. 10 Moreover, since the EFT should obey all symmetries of the underlying 
theory, it should be invariant under SU(2>)l x SU(3)r. 

Our task is therefore to construct local invariants out of the field E. Because of the 
transformation rule for E, cf. Eq. (12. lip , such invariants can only be formed by alternating 
E with E^, and taking a trace of such a product. One may also multiply such traces together 
to form new invariants. However, all such invariants collapse to a constant, because EE^ = 1! 
The only way out is to allow for derivatives on the fields, as in the first term of Eq. (I2.12p . 
Because of Lorentz invariance, we need at least two such derivatives, and we can organize 
terms in our lagrangian into groups of terms with the same number of derivatives. The 
unique term with just two derivatives is the one shown as the first term in Eq. (I2.12p . 

This construction leads to several fundamental consequences. First, since we need at 
least two derivatives in all terms, pion interactions vanish when their momenta vanish, and 
thus they become weak for small momenta (and masses, as we will see below). This, in turn, 
implies that we may organize our EFT in terms of a derivative expansion: pion correlation 
functions with small momenta on the external legs can be expanded in terms of a small 
parameter p/A x , where p is a typical pion momentum, and A x is the typical hadronic scale, 
of order 1 GeV. 11 

These observations give us a very powerful method for constructing the chiral lagrangian. 
If our aim is to work only to leading order in the derivative expansion, the chiral lagrangian 
is simply given by the first term in Eq. (12. 12ft : 

= ~/ 2 tr(^E%E) . (2.13) 



8 I will often use "pion" to refer to all (pseudo-) Nambu-Goldstone bosons, kaons and eta included. 
I am not aware of any general proof from the underlying theory, QCD. 



10 A more constructive argument, in which clustering plays a central role, was given in Ref. 

11 This assumes that there are no other "accidentally" massless (or very light) hadrons other than the pions. 
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Note that this is an interacting theory, because £ is nonlinear in the pion fields 0. It 
therefore describes all pion physics to leading order in pion momenta. For example, writing 
4> as = 4> a T a , with T a the generators of SU(3) obeying 

ti(T a T b ) = 5 ab , [T a , T h \ = V2tf abc T c , (2.14) 

one finds the leading-order prediction for the pion scattering amplitude from a tree-level 
calculation using Eq. (12.1 3ft : 

A(ab ->■ cd) = — !— {(t - u)f abe f cde + (s- u)f ace f bde + (s - t)f ade f bce } , (2.15) 

in which s, t and u are the Mandelstam variables. This is our first prediction from ChPT. 

Beyond leading order, we need to worry about two different (but, as we will see, intricately 
related) issues. We need to construct the most general term of order p 4 , i.e. , with four 
derivatives, and we also need to worry about unitarity (up to a given order). A nice way 
of building terms with higher derivatives starts from the observation that we can define an 
object with one derivative that transforms only under SU(3)l, or, analogously, an object 
which transforms only under SU(3)r: 

Lfj, = £<9 M £t = -«9 M £Et , (2.16) 
R„ = T^dfjT, = -<9 M Et£ = . 

The most general lagrangian with four derivatives is 13 

£ (4) = -Lx (tr(L M L M )) 2 - L 2 tr(L M L,) tr(L^) - L 3 ti{L^L u L v ) , (2.17) 

in which £1,2,3 are new (dimensionless) coupling constants. For general Nf, there is an 
additional term of the form tr(L /J L J/ L /i L j A but for Nf = 3 this can be written in terms of 



the three terms appearing in Eq. (I2.17P 12]. Even with only three light quarks, this is not 
true in the partially-quenched case, see Sec. HVl 

We see that the new terms in will contribute to pion scattering at tree level: each 
of these terms starts off with a {d<p/ f) 4 term when we expand S in terms of (p. Thus, 
indeed, such contributions will be of order (p//) 4 , i.e. , of order (p/ f) 2 relative to Eq. (12. 15[) 
- apparently / plays the role of the typical hadronic scale ~ 1 GeV, and (p/ f) 2 is the 
expansion parameter of ChPT - see below. 

In order to preserve unitarity, we should also calculate loop contributions, insofar as 
they contribute to order p A . They arise when we consider the one- loop scattering di- 
agrams with two four-pion vertices from £( 2 \ This leads to contributions of the form 
(s/47r/ 2 ) 2 log (— s/A 2 ) (and more such terms also involving t and u, as dictated by crossing 
symmetry), where A is the cutoff needed in order to define the theory, and the factor l/(47r) 2 
comes from the loop integral. The appearance of the logarithm is interesting for two reasons: 
it shows that, as mandated by unitarity, there is a two-particle cut in the pion scattering 
amplitude. Second, it is clear that a cutoff is needed in order to define theory. 

Physical quantities, such as the scattering amplitude, cannot depend on this cutoff, which 
has only been introduced in order to define the effective theory. But indeed, since also 
polynomial terms from appear at this order (order p 4 ), the cutoff dependence can be 
removed by taking £1,2,3 to be dependent on A, such that the scattering amplitudes (and 
other physical quantities) are not. We see that in order to renormalize the theory defined 
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by £( 2 \ we need to introduce C^. The new constants £1,2,3 represent the effect of the 
underlying physics that has been integrated out in the EFT. This is the beginning of a 
general pattern, as we will argue in Sec. Ill CI below. 

Before we discuss higher orders, let us consider order p 4 in some more detail. First, note 
that the one-loop contribution is in fact of order p 2 / (Airf) 2 relative to Eq. ( I2.15p . Any EFT 
involves an expansion in a ratio of scales, and in ChPT the higher scale in this ratio appears 
in the form Anf. The size of / can be estimated from pion scattering (using Eq. (I2.15P ); for 
another method, see Sec. Ill Dl At the level of ChPT, the "low-energy" constants (LECs) 
/ and Lj are free parameters, which can only be determined either by comparison with 
experiment, or by matching ChPT to a lattice QCD computation. We can say something 
about the "generically expected" values of the Lj. Since the Lj absorb the scale dependence, 
we expect that their magnitude changes by an amount of order 1/(Att) 2 log(A/A') when we 
change the scale A — > A'. Since the cutoff A appears because we integrated out all heavier 
hadrons, a physically sensible choice for A is the typical hadronic scale of 1 GeV. With this 
interpretation, varying the cutoff within an order of magnitude is reasonable. This shifts 
the Li by an amount of order l/(4n) 2 , and gives us a sense of what one expects the values 
of these LECs to be. 



C. Power counting 

Now, let us consider the derivative expansion more systematically, and show that indeed 
the chiral theory is a proper EFT. What this means is that there exists a systematic power 
counting, and that to any order in the expansion we need only a finite number of coupling 
constants to define the theory. 

Consider an amputated connected diagram with Vd vertices with d derivatives. Collec- 
tively denoting the external momenta by p, this diagram is of order J2d dVd — 21 + 4L in p, 
where / is the number of internal lines, and L is the number of loops. This follows from 
simply counting powers of p. Using, as usual, that the number of loops L = I — J2d Vd + I, 
we can rewrite this as J2d(d ~ 2)Vd + 2L + 2. Our diagram is thus of order 




/V(^) It) • CM*) 

with 

N = Y,k d - 2 ) V d + L , (2-19) 
d 1 

and E the number of external legs. The powers of / in this result follow from dimensional 
analysis. Ignoring the cutoff for now, this is the only other scale in the problem, if we express 
all other LECs as products of dimensionless constants times the appropriate power of /. 

We see that all contributions to a certain amplitude of a fixed order in external momenta 
correspond to a fixed value of N. For instance, for AT = 0, only tree-level diagrams, calculated 
from £( 2 ) contribute, because AT = requires that L = and Vd = for d > 2. For N = 1, 
one-loop diagrams coming from combine with tree- level diagrams coming from C^ 4 \ 
consistent with our discussion in the previous section. In general, if we calculate to some 
fixed order in N, we need the chiral lagrangian only up to £( 2Ar + 2 ). In other words, while 
our EFT is not renormalizable, it is nevertheless predictive if we work to a fixed order in the 



10 



derivative expansion, since to that order only a finite number of LECs occur in the chiral 
lagrangian [lO]. 12 

Each loop provides a factor l/(47r) 2 . If we assume (cf. Sec. Ill Bj) that this number also 



sets the natural size of all dimensionless LECs, that turns the momentum expansion into 
an expansion in powers of p 2 / '(4tt f) 2 . Then, so far we have ignored the fact that loops lead 
to divergences. That means that there is another scale, the cutoff, that can appear in our 
result. Putting everything together, we therefore amend our power-counting result: our 
diagram takes the schematic form (restoring a delta function for momentum conservation) 

(2.20) 

with F a dimensionless function. Contributions to F can be of three types. First, any 
positive powers of p 2 /A 2 can be ignored, as they correspond to higher-order contributions. 
Then, depending on the regulator, negative powers may occur (i.e. , positive powers of the 




cutoff); those correspond to power divergences. Since all divergences have to be local [13 
such divergences can be absorbed into lower-order LECs. Finally, there can be logarithmic 
divergences, which, as we have seen, can be renormalized by LECs in £( 2Ar + 2 ) if they occur 
at iV loops. In dimensional regularization, power divergences do not occur, making this 
regulator the most practical one for calculations in ChPT. 13 

The conclusion of this section is that a well-defined power-counting scheme exists, turning 
our EFT, i.e. , ChPT, into a systematic and practically useful tool. This observation will 
play an important role when we start using ChPT in applications to lattice QCD. 



D. Conserved currents 

It follows from Noether's theorem that there are sixteen conserved currents, associated 
with the generators of the group SU(3)l x SU(3) r . At the level of QCD, these currents 
are fixed by the lagrangian and the symmetry group, and the same should thus be true at 
the level of the chiral lagrangian. In both cases, these currents can be calculated following 
Noether's procedure. Equivalently, one may couple the QCD lagrangian to external gauge 
fields, £ M and r M (taken to be hermitian), which transform under local chiral transformations 
as 

l^Ud^l-id^jJjl , (2.21) 
?V -»■ U R rJJ ] R - id^U R U R , 

with the covariant derivatives in Eq. (12. ip turning into — > — ii^ when acting on left- 
handed quarks, and — > — ir^ when acting on right-handed quarks. This substitution 
makes the QCD lagrangian (we still are considering the case that M = 0!) invariant under 
local chiral transformations, and thus the chiral lagrangian should also be made invariant 
under local chiral transformations (in addition to Lorentz (or euclidean) invariance and 
parity). This can be accomplished by replacing 

-> = 8^ - i^E + zEr„ -> U L D^U R (2.22) 



12 The number of LECs grows rapidly with N, rendering our EFT practically useless beyond N 

13 For a different chirally invariant regularization using a lattice cutoff, see Ref . 14 1 . 
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everywhere in the chiral lagrangian, given to order p 4 by the sum of Eqs. (12. 13[) and (12.17p . 
The second arrow in Eq. (I2.22p indicates how the covariant derivative of £ transforms; 
Ul,r are now local transformations in SU(3)l,r- In addition, more invariant terms can be 
constructed, if we use also the building blocks 

V = d^-dJ^-i^Q , (2.23) 

At order p 2 there are no new terms (because these field strengths have two Lorentz indices) , 
but at order p A several new terms arise: 

tL 9 tr (V£> M £(A,£)t + R^D^DvZ) - L l0 tT(L^ER^) (2.24) 
—HitryL^uL^u + R^R^u) ■ 

With the chiral lagrangian as constructed above, we have that 

log ZchPT^fM, r>) = log Zq C d(^, r^) + constant , (2.25) 

up to the order at which we work in the chiral expansion. Since and r M couple to the 
left- and right-handed Noether currents, (connected) correlation functions of these currents 
are generated by taking derivatives with respect to £ M and r M . In words, Eq. (12.251) states 
that these correlation functions, as calculated in ChPT, equal those of QCD, to some given 
order in the derivative expansion. 

By taking one derivative of the chiral lagrangian with respect to or r M , and setting 
these sources equal to zero, we find the conserved left- and right-handed currents. Showing 
only the lowest-order explicitly: 

i/c^ + ..., (2.26) 
\fd^ + ... . 

Therefore, to lowest order in the derivative expansion, and writing 

= a T a , (2.27) 

T L,R _ jL,Rrp 
,J fi ~ ,J aii 1 a ) 

with T a the SU(3) generators normalized, as before, through tr(T a Xfc) = 5 a b, we get for the 
pion to vacuum matrix element of the axial current 

(0\J*{z) - J^(x)\Mp)) = -iPr/Sabe-** , (2.28) 

and we conclude that / is equal to the pion decay constant, in the chiral limit. 14 We 
also find another prediction: to leading order in the chiral expansion f n = f K = f v . 
We end this section with a number of remarks: 



Our normalization is such that in the real world f v = 130.4 MeV. Another common convention uses a 
value smaller by a factor \pl. 



Ji = \fd^ + ... = - 

j; = - l -j 2 ^d^ +...= 
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The observation that we can obtain conserved currents in the effective theory just as 
well from the Noether procedure or from the "source method" that I described above 
is correct, but both methods do not in general lead to the same current. At order 
p 2 the currents obtained using either of these methods are the same, but at order p 4 
they differ by a term proportional to L 9 . Clearly, if we do not introduce the sources 
and r M at all, the L 9 term in Eq. (I2.24p never appears, and therefore the Noether 
current has no term proportional to L 9 . But if we use the source method described 
above Eq. (12.261) and apply it to the terms in Eq. (12.241) . we find additional terms 

AJ^ = iUd v \L^L v \ , (2.29) 
AJ* = %L % d v [R^R v \ . 

However, these extra terms are automatically conserved, d^AJ^ 11 = identically. 
Therefore, the most general form of the conserved current is that provided by the 
source method. 

When we gauge the group SU(3)l x SU(3)r, as we did above, it is in fact anomalous. 
In order to reproduce the "nonabelian" anomaly in the chiral theory, we need to add 
the gauged Wess-Zumino-Witten term [l5j]. Since this part of the chiral lagrangian 
will play no role in the rest of these lectures, we will not discuss this any further. 

While (apart from the anomaly), the derivation given above looks very straightforward, 
in fact the freedom to perform field redefinitions and to add total-derivative terms 
is needed to complete the proof that the correct recipe is to just choose the chiral 
lagrangian to be locally invariant 

A new term proportional to Hi, not containing the pion fields E, shows up. This term 
can only contribute contact terms to correlation functions and is thus not physical. 
For instance, if one considers the two-point function (J^(x)J^(y)) at one loop, this 
has a logarithmic divergence, which can be absorbed into Hi. Such parameters in the 
chiral lagrangian are sometimes referred to as "high-energy constants." 

The coupling of the chiral lagrangian to the sources and tells us how to couple 
pions to the photon and to (virtual) W and Z mesons. For instance, one obtains the 
photon coupling by taking 




rv = --ecv 0-1 , (2.30) 



with a M the photon field and e the charge of the electron. 

Generally, electromagnetic effects in hadronic quantities are of approximately the same size 
as the isospin breaking coming from the fact that in nature m u ^ ma- This implies that 
ultimately electromagnetic effects will have to be taken into account in lattice QCD compu- 



tations. For explorations in this direction, see Ref. [16 . 
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E. Quark masses 



It is time to remember that in the real world the masses of the up, down and strange 
quark do not vanish. Generalizing Eq. (I2.5p . the mass terms in Eq. (12. ip can be replaced by 
source terms, with hermitian scalar and pseudoscalar sources s(x) and p(x): 

q L (s + ip)q R + q R (s - ip)q L , (2.31) 

with SU(3)l x SU(3) R transformation rules 

s + ip ^ U L (s + ip)U R , s - ip U R (s - ip)U{ . (2.32) 

We recover the quark mass terms by setting s = M with M as in Eq. (12. 3p and p = 0. 

This gives us a new building block for constructing terms in the chiral lagrangian. To 
lowest order in s and p, using parity symmetry, there is a unique operator that can be added 
to C^ 2 \ which now becomes 

= \f^ ((D^D.Z) - \f tr ( X t£ + S t x ) , (2.33) 
X = 2B (s + ip) , 

with B a new LEC. Setting x = X^ = 2B M, and expanding £ to quadratic order in 0, we 
can read off the (tree-level) pion masses in terms of the quark masses: 

ml+ = B (m u + m d ) , (2.34) 

m 2 K+ = B (m u + wig) , 

m 2 K o = B (m d + m s ) , 

ttv = A) I m u + m d + I — I I , 

m 2 = ^B (^rn u + m d + 4m s + O 

This gives our third prediction from chiral symmetry: five meson masses are expressed in 
terms of three parameters. If we ignore isospin breaking (i.e. , set rug = m u = m d ), we find 
that m^o = 771^+ and m 2 = (2(m 2 K+ + m 2 K o) —m 2 r+ )/3, relations which agree with experiment 
at the few percent level. The latter relation is the well-known Gell-Mann-Okubo relation. 

Since the masses on the left-hand side are physical quantities, also the expressions on the 
right-hand side should be physical; in particular, they should not depend on the renormal- 
ization scale of the underlying theory. Indeed, by taking a derivative with respect to s and 
then setting all sources equal to zero, we find that in the chiral limit 

(uu) = (dd) = (ss) = -f 2 B , (2.35) 

and we then use that m u (uu), etc., are scale independent. The quark masses appear in the 
chiral lagrangian only in the scale-independent combinations 

Xu = 2B m u , Xd = 2B m d , Xs = 250ms . (2.36) 

From this tree-level exercise, we see that for onshell momenta, one power of the quark 
mass should be counted as order p 2 , because for an onshell pion we have that p 2 = —m 2 +, 



m,, 



m d 
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etc. With this power counting for the quark masses, the whole discussion of Sec. Ill CI applies, 
with the proviso that p 2 in formulas like Eq. f |2.20j) can now also stand for any of the squared 
meson masses. With / = 130 MeV, we note that the expansion parameters of the chiral 
expansion are 



ml m K 



which gives one hope that even for kaons the chiral expansion may be well behaved. We will 
return to this issue in Sec. |VJ As an aside, we also see why the heavy quark masses (charm 
etc.) cannot be accounted for in ChPT: The corresponding "expansion" parameters would 
not be small. In fact, the situation is "upside down": one may instead consider expansions 
in AQCD/mheavy, leading to Heavy- Quark EFT. 15 

In order to find the meson masses, we expanded the nonlinear £ around 1. Indeed, when 
all masses in Eq. (12. 3p are positive, £ = 1 is the minimum of the potential, which is given 
by the second term in Eq. (I2.33p . When we allow two of the masses to be negative, the 
situation is essentially the same. For instance, if m u ^ < while m s > 0, the vacuum is 

£„ ac = diag(-l,-l,l) , (2.38) 

which is equivalent to the trivial vacuum by an SU(3) rotation. When, however, an odd 
number of quark masses are negative, there is no SU (3) rotation relating the vacuum to the 
trivial vacuum, and the theory can be in a different phase, in which CP is broken because of 
the appearance of a #-term with 9 = 7r (17|. Since in nature all quark masses are positive (or 
equivalently, 9 = 0), we will only consider the case of positive quark masses throughout this 
talk. 16 However, we will see in Sec. IIII CI that a nontrivial vacuum structure is nevertheless 
possible for some discretizations of QCD at nonzero lattice spacing. 

Of course, with the scalar source x as a new building block, more terms can appear in 
£( 4 ) as well. Gauging Eq. (12.1 7p and including Eq. (I2.24p . the most general form becomes 17 

£ (4) = -Li (tr (J?^(fl^)t)) 2 - L 2 tr (p^DX?) tr (z^E^E)*) (2.39) 
-L 3 tr [DX D ^) ]D ^{Du^) ] ) 

+L 4 tr ( J D M E(D M E)t) tr( X t S + tf X ) + L B ti (l^E^E) %t£ + £ f x)) 
-L 6 (tr( X + S + Et x )) 2 - L 7 (tr( X + S - ^ X )f ~ L 8 tr( X ^X^ + ^X^x) 
+tL 9 ti (L^D^ipjTtf + R^{DX ] D U Y) - L 10 tr(L^Ei?^Et) 
-H^L^L^ + R^R^) - H 2 tr(x t x) • 

The full lagrangian + of Eqs. (12.331) and (I2.39P is invariant under the local group 
SU(3)l x SU(3)r. We note that five new LECs and one new "high-energy" constant have 
been added. The term proportional to H 2 contributes to the mass dependence of the chiral 
condensate, and is needed for instance in order to match lattice QCD, where the condensate 
includes a term proportional to m/a 2 (with m the relevant quark mass). 



15 For an introduction to heavy quarks, see the lectures by Rainer Sommer at this 

16 As mentioned already in the first section, the scenario with m u = is ruled out 

17 Some terms in Eq. (|2.39|) have been removed by field redefinitions. 
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FIG. 1: The functions 1.125 + L(ml) (red curve) and 1 + m^/(47r/) 2 (blue line), as a function of 
ml in GeV 2 . I took A = m p = 770 MeV and / = 130 MeV. 



As an application, we quote some results of the quark-mass dependence of various physical 
quantities to order p 4 |l2j, in the isospin limit m u = m^. 



U = f{l + j^fht + ^(2m< + m s ) - 2L(ml) - L(m 2 K ) \ , (2.40) 



-r- — —r- { 1 + — 9 „ 9 3m„ log — f + ml log 



ml 1 8(2£ 8 -L 5 ) . 16(2L 6 - LA tn A A , r/ 1 r/ 2 , 
2^ = 1+ /2 2m, + ^-^ H2m f + ffls )+LK)--I(m) , 



in which 



= 2B m u = 2B m d , m s = 2B m s , ( 2 -41) 

,2 /™2\ 



L(m = Ji^ff 8 ( A>, 



and where we can use the tree-level expressions ( 12.341) inside the logarithms. Because all 
meson interactions that follow from the chiral lagrangian are even in the meson fields, 18 and 



18 This is not true for the Wess-Zumino-Witten term, which however at this order does not contribute to 
the quantities of Eq. (|2.40p to order p 4 . 
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FIG. 2: Comparison of the PACS-CS (red) and the CP-PACS/JLQCD (black) results for ml/m^ 1 
as a function of the light quark mass rn^ 1 (the so-called "axial Ward identity" definition of the 
quark mass). The vertical line denotes the physical point. From Ref. 



the calculation of both masses and decay constants involves two-point functions, the only 
diagrams that appear at one loop are tadpole diagrams. 

Again, ChPT makes a prediction: if we use fx/ fir — 1-2 as input, one finds that (to this 
order in ChPT) f n / f w = 1.3. With these values for f K j ' f w and f v / f n we see that ChPT 
to order p A works reasonably well for quantities involving the strange quark, with order-p 4 
corrections of about 20-30%. 19 For a discussion of meson masses and decay constants to 
order p 6 in ChPT, see Ref. jll[. For a recent review of the phenomenological status of ChPT, 
and many references, see Ref. 

An important observation is that all LECs, /, Bq and the Li's, are independent of the 
quark mass; they only depend on the number of flavors, A/. 20 All quark-mass dependence 
in Eq. (I2.40p is explicit. It follows from this that the values of the LECs can, in principle, 
be obtained from lattice QCD with unphysical values of the quark masses (as long as they 
are small enough for ChPT to be valid — an important restriction). Thus, if one finds a 
set of quantities through which all LECs can be determined from the lattice, one can then 
use these values to calculate other quantities that are less easily accessible on the lattice. A 
simple example of this is that one can extrapolate physical quantities to the physical values 
of the light quark masses, from lattice computations at larger values of mi in the isospin 
limit. 21 Unfortunately, it is not possible to extract all the Li from two-point functions, since 
the constants £1,2,3 do not show up in masses and decay constants. 
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For a recent discussion of the experimental value of f v , including mixing because of SU(3) breaking, see 



Ref. 



20|. 



20 For the relation between N f = 3 LECs and N f = 2 LECs, see Ref. Q. 

21 Since also EM interactions break isospin, they also will have to be taken into account if one reaches a 



precision at which isospin breaking becomes significant. 
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We conclude this section with an important lesson about the use of ChPT for fitting 
the quark mass dependence of a hadronic quantity on the lattice. As we see in Eq. (I2.40p . 
nonanalytic terms show up, here in the form of so-called "chiral logarithms." Fig. [1] shows 
how important it can be to include such nonanalytic terms in chiral fits, by comparing the 
functions 1.125 + Limfy with 1 + m^/(47r/) 2 . The logarithm has a dramatic effect: it looks 
nothing like the linear curve in the region around the physical pion mass, m 2 = 0.02 GeV 2 . 
If one would only have lattice data points in the region m 2 > 0.2 GeV 2 with error bars that 
would more or less overlap with both curves, and one would perform a linear fit, 22 we see 
that this might lead to errors of order 10% in the value at the physical pion mass. In order 
to confirm the existence of the chiral logarithm in the data, clearly data points in the region 
of the curvature of the logarithm are needed. In the cartoon example of Fig. [1] this means 
that we need data points in the region down to m v ~ 200 MeV. In addition, data points 
with very good statistics help. 

For a realistic example, see Fig. [2} which shows m^/rru as a function of the light quark 
mass, rri£, for fixed (physical) strange mass, from Ref. 22| (to which I refer for details). If 
one would only consider the large points (the black points), one would not be able to 
reliably fit the chiral logarithm in Eq. (I2.40p . and thus also not the linear combination of 
LECs that accompany the logarithm. Only with the red points, which were obtained for 
small values of m n , or equivalently m?, can one hope to perform sensible chiral fits. For a 
discussion of such fits, see Ref. [22|. Here we only note that both 0(a 2 ) effects and finite 
volume can in principle modify the chiral logarithms. These issues can be systematically 
studied with the help of ChPT, as we will see in Sees. II I II and IVT1 



III. CHPT AT NONZERO LATTICE SPACING 

If we compute hadronic quantities on the lattice, the values we obtain will differ from 
their continuum values by scaling violations — terms of order a n with some positive integer 
power n (possibly modified by logarithms), where a is the lattice spacing. For example, for 
the mass of some hadron evaluated on the lattice at some nonzero a we expect that 

^-lattice = M con ti nuurn {jYl quark) "I" C l a ^-QCD 0{cL , ClTTl quar k) , (3-1) 

with Ci at most logarithmically dependent on a. 23 In Eq. (13. ip I indicated schematically 
that the hadron mass also depends on the quark masses in the theory. 

Evidently, at nonzero lattice spacing, there is a "new" infrared scale ah 2 q CD that emerges 
from combining the physical infrared scale Aqcd with the ultraviolet scale a. Physical 
quantities thus depend not only on the quark masses, but also on this new scale, and one 
expects that a combined expansion in terms of m quar ^ and aK 2 q CD should be possible when 
these parameters are both small enough. The goal of this section is to see how ChPT can 
be extended to provide this combined expansion. 

Before we begin doing that, there are two important observations. First, ChPT is con- 
structed by making use of the chiral symmetry of the underlying theory — it is nothing more 



Both functions can be thought of as two-parameter fits. In the case of the logarithm, the two parameters 
are the constant and A, in the case of the linear function, they are the constant and the slope. 
For lattice regulators with an exact chiral symmetry, one expects scaling violations of order a 2 instead of 
order a. See for example Sec. MI Fl 
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or less than a very efficient way of implementing the Ward identities for chiral symmetry. 
This implies that if we use a chirally invariant regulator on the lattice, such as domain-wall 
fermions 24 or overlap fermions, there will be almost no change in the form of the chiral 
lagrangian. Almost no change, because for any lattice regulator continuous rotational in- 
variance is broken, so one expects that new operators will appear in the chiral lagrangian 
that break continuous rotational invariance, but respect the lattice subgroup (usually the 
group of hypercubic rotations). Quite generally, in the construction of the chiral lagrangian 
for Nambu-Goldstone bosons, the only quantity transforming as a four-vector is the deriva- 
tive, so such terms will have to have at least four derivatives. Combined with the fact that 
they need to carry a positive power of the lattice spacing, they are usually of quite high 
order in the chiral expansion, as we will see. The only other difference is that the continuum 
LECs themselves are afflicted by scaling violations, which can in principle be removed by 
an extrapolation to a = 0. 

It is only for regulators that break some of the continuum chiral symmetries that the form 
of the chiral lagrangian will be different. The two most important examples are Wilson-like 
fermions (including also clover fermions and twisted-mass QCD), for which vectorlike flavor 
symmetries are preserved but chiral symmetries are broken, and staggered fermions, for 
which flavor symmetries are broken, and only one chiral symmetry is preserved. Here I will 
primarily discuss Wilson fermions, since they are conceptually the simplest. Section fill Fl is 
devoted to a brief overview of ChPT for QCD with staggered quarks. 

A second point is that we will be dealing with an expansion in two small parameters, 
m quark and aAg CD , and we thus have to define a power counting — we have to compare their 
relative size in order to know to what relative order in each parameter we have to expand. 
Typical quark masses in lattice computations range between 10 and 100 MeV, and typical 
lattice spacings vary between aT 1 ~ 2 — 3 GeV. With Aqcd ~ 300 MeV, this means that 
m q uark and ciAq CD are of the same order, and this is what I will assume to get the discussion 
started, until Sec. IIII CI 

QCD with Wilson fermions has no chiral symmetry, and it thus seems that our guiding 
principle for constructing the chiral effective theory is completely lost: we should allow for 
all kinds of operators in the chiral lagrangian that are only invariant under SU(3)y, and the 
only thing we know is that any such operator that is forbidden in the continuum theory is 
multiplied by some positive power of the lattice spacing. It thus seems that, while an EFT 
at low energies should exist for small enough lattice spacing, it will not be of much practical 
use. It turns out, however, that this is not true. 

The solution to this apparent roadblock consists of first considering a "low-energy" con- 
tinuum EFT of quarks and gluons, where "low energy" means that we want to consider 
correlation functions of quarks and gluons at momenta Aqcd < |) < Since at such 
momenta the degrees of freedom in this EFT are the same as in the lattice theory, no pow- 
ers of Aqcd can appear in this EFT, and any operator of order a n (n > 0) thus has to be 
multiplied by an operator of mass dimension 4 + n, so that the full lagrangian always has 
dimension four. This means that, for small values of n the operators that can appear in this 
EFT are very constrained, and, as we will see, this allows us to also restrict the form of the 
operators for small values of n in the chiral theory. The idea of considering the quark-gluon 
continuum EFT for the lattice theory is due to Symanzik [231 ]. and we will refer to this EFT 
as the Symanzik effective theory (SET). The insight that this can be used to develop ChPT 



At very small m res iduaV, see David Kaplan's lectures at this school. 
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including scaling violations systematically is due to Sharpe and Singleton 
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A. Symanzik effective theory 

Let us begin with a brief review of the construction of the SET for lattice QCD with 
Wilson fermions. 25 The Symanzik expansion is an expansion in powers of a, 

Cset = 4 4) + aCf + a 2 Cf + ... . (3.2) 

The first term, Cg, is the continuum theory, 

= Zgiuons + qflq + qMq , (3.3) 

with M the quark mass matrix given in Eq. (I2.3p . 26 In fact, because of the lack of chiral 
symmetry of Wilson fermions, the first term in the Symanzik expansion is a term of di- 
mension three of the form (c/a)qq with c a numerical constant. The quark masses in M 
thus have to be defined as the difference between the lattice quark masses and the power 
divergence c/a: 

rrii — Zg (mo,j — c)/a , i — u,d,s, (3.4) 

where mo,i is the bare lattice mass for flavor % in lattice units, and Z$ is the multiplicative 
renormalization factor needed to relate lattice masses to the continuum regulator that we 
use to define the SET. It is conceptually easiest to think of the continuum regulator as 
dimensional regularization (in some minimal subtraction scheme). This brings us to an 
important point: the fact that, for momenta Aqcd p <C an effective theory of 

the form flTTTAl) can be used to represent the lattice theory was only argued to exist using 
perturbation theory. 27 It is an assumption, essentially based on locality, that the SET is 
also valid nonperturbatively: the claim is that discretization effects in correlation functions 
with all momenta Aqcd *C P ^ V a are given by insertions of local operators, to any finite 
order in an expansion in ap. We note that this is nothing else than the basic assumption 
underlying the construction of all EFTs, if we replace ap by the ratio of scales that provides 
the small parameter for the EFT. 

Of course, in order to "think" about the SET outside perturbation theory, one would 
have to use a nonperturbative regulator — such as the lattice! One can imagine using a 
lattice regulator to define the SET with a lattice spacing a' much smaller than a, so that 
p <C 1/a <C 1/a', and since a' <C a one does not have to include powers of a' in the 
expansion (13. 2p . This ignores the possibility of power divergences in the smaller lattice 
spacing of the form 1/a' or 1/a' 2 , which one might have to subtract in order to define the 
SET. Since I will only use the form of the SET, without ever doing any calculations with it, 
I will sidestep this issue, and assume that we can work with the expansion (13. 21) . 28 



25 For a more detailed discussion of Symanzik effective theories in the context of improvement, see the 
lectures of Peter Weisz at this school. 

26 For the application of these ideas to twisted-mass QCD, se e th e review of Ref. 

17 And, in the original work by Symanzik, only for 4 theory 23 1 and the two-dimensional nonlinear sigma 



25], although there is little doubt that the ideas carry over to gauge theories as well. 



model 

28 Since we are considering a renormalizable theory, there should only be a finite number of power divergences 
in a'. For more discussion of related issues in the context of staggered fermions, see Ref. EF 
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Next, let us consider Cg. This consists of all dimension five operators consistent with 
the exact symmetries of the underlying lattice theory. For Wilson fermions 

Cf = h qia^G^q + b 2 qD^D^q + b 3 qMJpq + b 4 qM 2 q + b 5 tv(M)C gluona , (3.5) 

in which the b{ are real dimensionless coefficients, independent of quark masses, that can 
be calculated in perturbation theory, or, nonperturbatively, on the lattice. G^ u is the gluon 
field-strength tensor. Note that the bi are not really constant: they depend on logarithms 
of the ratio of the scales used to define the lattice theory and the SET. If dimensional 
regularization is used for the SET, bi = bi(g 2 (fi), log a//). We will ignore this additional 
logarithmic dependence on a, as it should be much milder than the explicit power dependence 
on a in Eq. < K2\f . 

Since the 63 and 65 terms are of the form quark mass times terms which already occur 
in Cg\ they will translate into 0(p e ) terms in ChPT. The counting here works as follows. 
Terms in translate into 0(p 2 ) terms in ChPT, cf. Eq. (I2.13p . The explicit factors of 
quark mass and a add two powers of p 2 , since we chose a power counting scheme in which 
p 2 ~ m quar kAQCD ~ a ^%cD- Since here we will work to order p A , we may drop these terms 
from consideration. The same is true for the 64 term, which is of order a(m quar k) 2 . 

At the next order in the Symanzik expansion, the number of terms of dimension six, 



consistent with all exact lattice symmetries, proliferates, and I will not list them all [27 
There are purely gluonic operators, that do not break chiral symmetry, but they may break 
Lorentz (or, rather, euclidean) invariance, such as 

^(D^G^D^G^) , (3.6) 

where for clarity I showed index sums explicitly. While this term would thus lead to new 
terms in the chiral lagrangian, it is easy to see that these are higher order than p A . As we 
already noted before, in order to construct a chiral operator that breaks euclidean invariance, 
one needs at least four derivatives. With the additional two powers of a that multiply terms 
in £5 , this makes such terms 0(p s ) in our power counting. 29 Note that this observation 
hinges on the fact that in pion ChPT the derivative is the only object with a Lorentz index. 
For instance, in baryon ChPT this is not true (there is also the four-velocity of the heavy 
baryon, which is order one in power counting), and the effects of the breaking of continuous 
rotational symmetry already shows up at order p A (28[. 

Operators involving quarks are of two types: bilinear and quartic in the quark fields. 
In order to generate new terms in the chiral lagrangian, they have to break one of the 
symmetries of the continuum theory. There are bilinears which break euclidean invariance, 
which, as we argued above, we do not have to consider at order p A . Dimension-six operators 
that break chiral symmetry (and are not of the form tr(M)£c^) are, in the basis of Ref. 29 



Of = (gg) 2 , Og = (qt r q) 2 , (3.7) 
Of = {qi,q) 2 , Off = {qt rl ,qf , 
Oq ] = (qcr^q) 2 , { S = {qUa^q) 2 , 



where t r are the SU(3) co i or generators. 



Such terms would be 0(p 6 ) in a power counting in which m quar k ~ a 2 Ag C£) . 
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B. Transition to the chiral theory 



Now we are ready to make the transition to the chiral theory. First, consider terms 
of order a, corresponding to Cg in Eq. ( 13. 2p . Using a trick similar to the one we used 
translating quark-mass terms to ChPT, we introduce a spurion field A, and write the first 
term in Eq. (I3.5P as 

abi qia^G^q -> b x (q L io -^G ^Aq R + q R ia^ v G^ v A ] qLj , (3.8) 

which is invariant under SU(3)l x SU (3) r if we let A transform as (cf. Eqs. ( 12. 5 p and ( I2.32[) ) 

A U L AU R . (3.9) 

Note that we recover the term on the left of Eq. ( 13. 8 p by setting A = al. 

We see that the b\ term, from the point of view of chiral symmetry, has exactly the same 
structure as the quark mass term. At lowest order in ChPT, we thus find a new term at 
leading order in the chiral lagrangian of the form 

- ^f 2 W tr(A f S + S f A) , (3.10) 

in which Wo is a new LEC analogous to Bq. The 62 term in Eq. ( 13.51) has exactly the same 
chiral structure, and thus does not lead to any new operator in ChPT. This is an example 
of the fact that there is in general not a one-to-one correspondence between operators in the 
SET and in ChPT. 

Once we set A = al, the new term can actually be completely absorbed into the mass 
term in Eq. ( I2.33p . if we shift the quark mass matrix by 

M — > M' = M + — al , (3.11) 
Bo 

or rrii — > m i = mi + , 1 — u,d,s . 

Bo 

If we determine, nonperturbatively, the lattice quark masses by requiring the pion mass to 
vanish, 30 then this determines M', rather than M — in other words, the shift of order a in 
Eq. (13. lip is automatically taken into account in the determination of the critical quark mass 
for Wilson fermions. Note that this does not remove all 0(a) effects from the theory when 
we extend it to include other hadrons, 31 or operators other than the action, cf. Sec. IIII El 

Next, we consider terms of order ap 2 ~ am quar k ~ a 2 , all of order p 4 in our power counting 
(appropriate powers of Aqcd are implicit). These come from two sources: higher order terms 
in the spurion A, as well as terms arising from However, both of these lead to the same 
new 0(p 4 ) terms in the chiral lagrangian. The point is that the same spurion field can be 
used to also make the operators in Eq. (13.71) invariant under SU(3)l X SU(3)r: 

a 2 {qq) 2 (q L Aq R + q R ^q L ) 2 , (3.12) 



30 Or the PC AC quark mass to go to zero. 

31 For instance, the nucleon mass has O(a) corrections which are not removed by the shift p. Ill) [28l. l30| 
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and similar for the other operators in Eq. (13. 7p . All these operators can thus be made 
invariant using the "composite" spurions 

A ® A ->■ U L AU R ® U L AU R , (3.13) 
A ® A ] -> [/ lAU r ® U R A^Ul , 
A f ® A -> E^f^ ® U L AU R , 
A ] ®A ] -> E/hAM ® UrA^U^ ■ 

Using these spurions, we find the following new terms in the chiral lagrangian 

tr(AE f ) tr(AE f ) + h.c. -> a 2 (tr(E f )) 2 + h.c. , (3.14) 

tr^EUS*) + h.c. -> a 2 tr^E*) + h.c. , 
tr(AE f ) tr(A f E) -> a 2 tr(E f ) tr(E) . 

Setting sources £^ and r M equal to zero, 32 the complete set of additional terms in the chiral 
lagrangian at order p A , including new LECs multiplying each of the new operators is 29l 



A£ (4) = aW 4 tr(9 M E9 M E f ) tr(E + E f ) + aW 5 tr (^E^E^E + E f )) (3.15) 

-aW 6 tr(ME f + EM) tr(E + E f ) - aW 7 tr(ME f - EM) tr(E - E f ) 
-aW 8 tr(ME t E t + EEM) 

-a 2 W 6 (tr(E + E f )) 2 - a 2 ^ (tr(E - E f )) 2 - a 2 W'% tr(EE + E f E f ) , 

in which 

M = 2B M , a = 2W a . (3.16) 

If one considers (nonperturbatively) 0(a) improved QCD with Wilson fermions, one can use 
the same chiral lagrangian. It simply means that Wq = in Eq. (I3.10p . Note that this does 
not mean that we should set all terms of order a 2 equal to zero, despite our convention to 
include a factor of Wo in the definition of a! The reason is the following. If the underlying 
theory is 0(a) improved, all terms of order a in the SET (cf. Eq. (13. 5p ) vanish, and there 
is no need to introduce the spurion A at all at this order. However, this does not mean 
that there are no 0(a 2 ) terms in the SET, so we do need to introduce 0(a 2 ) spurions as 
in Eq. (I3.12p . It just so happens that we can "recycle" the same spurion A at order a 2 
by taking tensor products as in Eq. (I3.13p . If underlying lattice theory is not improved, 
there are also contributions at order a 2 proportional to the 0(a) coefficients of Eq. (13.51) . in 
addition to those proportional to appearing in the SET at order a 2 . At the level of AC^ 
both types of contribution at order a 2 take the same form, given in Eq. (I3.15p . 

As an application, let us consider the modifications to the 0(p 4 ) result for the pion mass, 
cf. Eq. (12.401) [29[. First, the overall factor fhg gets replaced by rfi£ + a = 2B m' e , with m' e 
the shifted light quark masses of Eq. (13.111) . and likewise, the logarithms get replaced by 

L(ml) ->• L(2B m e + 2W a) = L(2B Q m' t ) , (3.17) 

etc. Both of these modifications are a consequence of the fact that the 0(a) term (I3.10P can 
be absorbed into the shifted quark masses of Eq. (13.111) . In this case, as we already saw 



:S2 



For the case including all sources and new "high-energy" constants, see Ref. [31 1. 



23 



above, the quark masses that one determines in a numerical computation are the shifted m! 
masses, so the 0(a) shift in Eq. (I3.17P is not observable. 

Second, the terms proportional to the L»'s in Eq. (I2.40p also change. Not surprisingly, in- 
stead of only terms proportional to Lim 2 uark , one now also finds terms of the form Wiam quar k 
and W[a 2 . At order a, the chiral lagrangian only "knows" about the shifted quark masses 
m\ of Eq. (13. lip , as the 0(a) operator (I3.10P is absorbed into the quark masses before tuning 
them to be near the critical value. The same mechanism does not work at order a 2 . One 
may thus imagine working with quark masses so small that the shifted light quark mass m' e 
becomes of order a 2 AQ CD . This would imply that the power counting has to be modified; 
for instance, the W$ 7 8 terms in Eq. ( 13 . 15[) now become leading order. This is the so-called 
LCE (large cutoff effects) regime; the regime in which mi ~ m' e ~ ah 2 Q CD is often referred 
to as the GSM (generic small (quark) mass) regime. 33 So far, our discussion has been in the 
GSM regime. Before we discuss some of the ChPT results in the LCE regime, however, we 
consider the phase diagram of the theory, as it turns out that the competition between the 
0(m' e ) terms and 0(a 2 ) terms in the chiral lagrangian can lead to various nontrivial phase 
structures, depending on the sign of an 0(a 2 ) LEG 



C. Phase diagram 

So far, we have been expanding £ around the trivial vacuum in order to calculate various 
physical quantities, such as in Eq. (I2.4(jp . Indeed, with x — 2B M with M as in Eq. (12. 3p 
with all quark masses positive, E = 1 corresponds to the minimum of the classical potential 
energy, given by the second term in Eq. (12 .33 p . For a = 0, higher order terms in the chiral 
expansion of the potential are smaller, and thus do not change this observation. 

When we include scaling violations, there are two small parameters, m quar k and aAQ CD . 
In the previous section, we took these two parameters to be of the same order, and again 
the leading order nonderivative terms in Eqs. (I2.33P and (I3.10p constitute the leading-order 
classical potential. As long as the shifted quark mass of Eq. (13. lip stays positive and in 
the GSM regime, the vacuum remains trivial. However, if we now make the shifted quark 
mass smaller, so that at some point m' quark ~ a 2 Ag CD , the terms with LECs Wq 78 lYl 
Eq. (I3.15P become comparable in size, and should be taken as part of the leading-order 
classical potential that determines the vacuum structure of the theory. This is the regime 
that we denoted as the LCE regime at the end of Sec. IIII Bl [24| (see also Ref. (32[). 

The strange quark mass is much larger than the up or down quark masses, so it is 
interesting to consider the case that mi = m u = m^ <C m s , with mi ~ a 2 K 2 Q CDl while m s 
stays in the GSM regime. 34 We then expect the vacuum value of £ to take the form 




(3.18) 



in which S 2 denotes an SU(2) valued matrix. The interesting phase structure is thus essen- 
tially that of a two-flavor theory, and in the rest of this subsection we will analyze the phase 
structure for Nf = 2. For Nf = 2, the classical potential, which consists of the second term 



The LCE regime is also commonly referred to as the Aoki regime. 

I will drop the primes on the shifted quark masses in the rest of this section. 
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in Eq. fl2T33|) . Eq. (ETTQj) . and the W^ 78 terms in Eq. (T37T5]) . becomes 

1 1 2 

= --/ 2 m £ tr(£ 2 + £ 2 ) - + -Wg') (tr(S 2 + S 2 )) + constant , (3.19) 

where we have used the SU(2) relation 

tr(S 2 S 2 + S 2 S 2 ) = l - (tr(E 2 + S 2 )) 2 - 4 . (3.20) 

All other terms in the chiral potential are of higher order. 

This potential exhibits an interesting phase structure. First, consider the case that W = 
32 (We + \W%){2W ) 2 / f 2 > 0. In that case, the potential is minimized for 

*={-!: £<2 < H '"> )- < 3 - 21 ) 

The pion mass is given (to leading order) by 



m 



TT,LO 



2\m £ \+2a 2 W (3.22) 



(where LO remind us that this is the pion mass to leading order), and attains a nonzero 
minimum value at rhi = 0. The phase transition is first order, with the vacuum expectation 
value S 2 exhibiting a discontinuous jump across rhi = 0. 

Equation ( 13. 22ft suggests what will happen for W < 0. 35 The pion mass-squared turns 
negative when \rh(\ < a 2 \W\, which signals the spontaneous breakdown of a symmetry. 
Indeed, writing 

E 2 = a + ir -7? , a 2 + vf 2 = 1 (3.23) 
(with t the Pauli matrices), we find that the potential is minimized for 

+1, m e >a 2 \W'\ 

, \m t \ <a 2 \W'\ (W'<0) . (3.24) 
-1 , m e < -a 2 \W'\ 

A phase of width a 3 in lattice units opens up, in which isospin and parity are spontaneously 



a 



broken, because of the formation of a pion condensate [24|. We can take the condensate in 
the 7r 3 = 7r direction in isospin space, in which case the charged pions become massless, 
becoming the exact Goldstone bosons for the breakdown of isospin. Since the condensate 
now varies continuously across the phase transition, this is a second order transition. The 



possibility of such a phase was first noticed by Aoki [35], and provides a possible mechanism 
for pions to become exactly massless with Wilson fermions at nonzero lattice spacing by 
tuning the quark mass. At the phase transition the charged pions are still massless, and 
because of isospin symmetry at that point, the neutral pion also has to be massless at the 
transition. Note however, that if W > 0, pions will not be massless at the phase transition 
for nonzero a. 

There is a lot more than can be said about the phase structure. We will not do so in 
these lectures, but end this section with a few comments: 



:$5 



For W = higher order terms in the potential would have to be considered 33 1. 
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1. An important observation is that the chiral lagrangian does not only provide us with a 
systematic expansion in the small infrared scales of the theory (the quark masses and 
aAq CD ), but it also provides nonperturbative information about the phase structure 
of the theory. 

2. In two-flavor QCD with Wilson fermions, one may consider a more general quark-mass 
matrix 

M = m e l + ifir 3 . (3.25) 

This leads to tmQCD, Wilson- fermion QCD with a twisted mass. In the continuum, 
one can perform a chiral rotation to make the mass matrix equal to M = y/m 2 + J?l, 
but with Wilson fermions there is no chiral symmetry, and the theory for \x ^ differs 
from the one with \i = 0. We note that for p, ^ this generalized mass matrix breaks 
isospin symmetry explicitly, and \x thus serves as a "magnetic field" pointing the 
condensate in the T3 direction in the limit // — > (after the thermodynamic limit has 
been taken). It is straightforward to develop ChPT for this case; one simply substitutes 
this mass matrix into the (two-flavor) chiral lagrangian. For detailed discussions of 
tmQCD, see for instance the reviews in Refs. HE!- 



D. The LCE regime 

We have seen that 0(a) effects, while present for (unimproved) Wilson fermions, can 
be completely absorbed into a shifted quark mass in the pion chiral lagrangian. While we 
already considered 0(a 2 ) effects in the pion masses in the GSM regime toward the end of 
Sec. lIIIBl it is interesting to consider effects from 0(a 2 ) scaling violations in the LCE regime, 
where m' e ~ a 2 Aq CD . We will always work with m' e , cf. Eq. (13.111) . and in this section we 
will again restrict ourselves to Nf = 2, thinking of the strange quark as heavy. This makes 
sense, as we may expect that numerical computations with very small light quark masses, 
which may then indeed turn out to be of order a 2 Aq CD , will become more prominent in the 



near future. The results we consider in this section have been obtained in Ref. [36|. 36 They 
are valid in the phase with unbroken isospin and parity. In other words, we will choose m' e 
such that m 2 LO of Eq. (I3.22p is always nonnegative. 

The first thing to note is that, since our power counting changes relative to the GSM 
regime, new terms need to be added to the chiral lagrangian if one wishes to work to order 



4 

p . 



LCE regime: 0(p 2 ) 
0(p 3 ) 
0(p A ) 



p ,m' e ,a , (3.26) 



ap 2 , am' £ , a 3 , 



4 2// t\2 2 2 2 / 

p , p m e , (m e ) ,ap,a m e , a 



Since we will be interested in the nonanalytic terms at one loop, which follow from the 
lowest-order chiral lagrangian, I just refer to Ref. [36] for the explicit form for all new terms 



not present in Eq. (j3.15p . which does already include all 0(p 2 ) terms. 



■Mi 



For earlier work, see Ref. 137 
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One finds for the pion mass to order p 4 



36 



< = <Lo{l + [(Klo - 10a 2 miog (^f)) (3-27) 

+ analytic terms proportional to m 2 LO and a j . 

Here rh\ LO is equal to m 2 LO (which is given in Eq. ( 13.221) ) up to 0(a 3 ) and 0(a 4 ) terms 
that have been absorbed into the definition of the quark mass. This corresponds to defining 
the critical quark mass as the value where the pion mass vanishes, but one should keep in 
mind that these scaling violations are not universal. 

This result is interesting because it shows that the continuum value of the coefficient of 
the chiral logarithm, which is a prediction of ChPT, can be significantly modified by scaling 
violations in the LCE regime. It also shows that at nonzero lattice spacing the chiral limit is 
unphysical: for rh 2 LO — > the chiral logarithm diverges, unless we take the lattice spacing 
to zero as well, at a rate not slower than the square root of the quark mass. Note however, 
that if W > 0, there is a nonvanishing minimum value of the pion mass which is of order 
a 2 , cf. Eq. f)3.22p . so that this divergence cannot occur. In the case W < the divergence 
can happen, and the only way to make mathematical sense of the result (13.271) is to resum 
higher powers of a 2 log(m 2 LO ). 37 Similar effects also show up in pion scattering lengths 



36[. 38 Because of the absence of chiral symmetry at nonzero a, pion interactions do not 
vanish for vanishing external momenta; the leading term is of order a 2 , and divergent chiral 



logarithms occur at order a 4 . Note that the results of Ref. 36j are obtained approaching the 
phase transition from the phase in which isospin is unbroken. If, for W < 0, one calculates 
scattering lengths approaching the phase transition from the other side, one would expect 
them to vanish. The reason is that on that side isospin is broken, and two of the three 
pions are thus genuine Goldstone bosons at nonzero a, with vanishing scattering lengths. 
Since isospin is restored at the phase transition, all pion scattering lengths should thus 
vanish approaching the phase transition from the phase with broken isospin. This is not 
a contradiction: scattering lengths are discontinuous across the phase transition between a 
symmetric and a broken phase. 

An important general lesson of this section is that Wilson ChPT allows us to choose 
quark masses so small that m 2 ~ a 2 AQ CD , i.e. , in a regime where physical and unphysical 
effects compete. In contrast, if we only had continuum ChPT as a tool, we would have 
to first extrapolate to the continuum before doing any chiral fits. However, the examples 
we have discussed make it clear that, in general, we would not want to take the chiral and 
continuum limits "in the wrong order": if one first takes the physical infrared scale m, 
to zero at fixed nonzero value of the unphysical infrared scale aAg CD , ChPT tells us that 
infrared divergences may occur, as we have seen above. 

We end this section by observing that the result for the GSM regime to 0(p 4 ) can be 
recovered from Eq. (I3.27P by expanding in a 2 and dropping all terms of order a 3 , a 2 m' e and 



37 It is not clear to me that this would lead to a less divergent result after resummation for all physical 
quantities in which such enhanced logarithms may occur, as it turns out to be the case for the pion mass 



37 1 



:s.s 



This reference contains an interesting idea on how to determine the value of W from the 1 — 2 pion 
scattering length. 
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a 4 . 



E. Axial current 

Before leaving Wilson fermions for staggered fermions, let us briefly look at currents in 
the presence of scaling violations; as an example I will consider the nonsinglet axial current, 
and we will work to order a, i.e. , we are back in the GSM regime. I will highlight some of 



the arguments presented in Ref. [38j|, which contains references to earlier work, and which 
also discusses the vector current. 

The issue is that for a 7^ 0, the axial current for QCD with Wilson fermions is not 
conserved. Since there is no Noether current at the lattice QCD level, there isn't one at 
the SET level, nor in ChPT. Since there is no conserved current, the lattice current that is 
usually employed is the local current 39 

A;(x) = q(x) lfll5 T a q(x) , (3.28) 

in which the T a are SU(3) or SU(2) flavor generators. First, this current needs a finite 
renormalization, in order to match to the continuum-regularized (partially) conserved axial 
current 



39 



A^ ren = Z A A; , (3.29) 

with Z A a renormalization constant determined nonperturbatively by enforcing Ward iden- 
tities. Since Z A is determined nonperturbatively, it includes scaling violations. Then, since 
A ^ is not a Noether current following from the lattice action, the effective current that rep- 
resents the axial current in the SET also does not follow from the SET action; instead, it is 
just an "external" operator for which we have to find the corresponding expression in the 
SET. To order a, the "Symanzik" current that represents the lattice current A ^ in the SET 
is 40 

K,set = (1 + b A am) (^ cont + ac A d,P? ont ) , (3.30) 
Accent = ll,T a q , P a cont = q lb T a q , 

in which h A and c A are Symanzik coefficients that depend on the underlying lattice ac- 
tion; "cont" indicates continuum operators. (In a fully 0(a) improved theory both b A and 
c A vanish.) By combining Eqs. (13.29j) and (I3.30p . we see that the overall multiplicative 
renormalization factor is (1 + b A am)Z A /Z A . Z\ is the all-orders perturbative matching fac- 
tor needed to convert a lattice current into the properly normalized (partially conserved) 
continuum current; we have that Z A = Z A + 0(a). 41 

At this stage, two steps need to be carried out to translate this current to ChPT. First, 
the operator A < ^ coni . + ac A d ll P^ oni . needs to be mapped into ChPT. Then, the renormalization 
factor following from Eqs. (I3.29P and ( I3.30P has to be determined in exactly the same way 
as the lattice current is matched to the renormalized continuum current in an actual lattice 
computation. This matching has to be done to order a as well, if one consistently wants to 



39 Or an improved version of the local current. 

40 See the lectures by Peter Weisz at this school. 

41 Only the ratio Z A /Z A appears when we combine Eqs. (|3.29p and (|3.30[) . 
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include all 0(a) corrections in the chiral theory. Since this involves the matching of scaling 
violations, one does not expect the outcome of this second step to be universal, and indeed, 
Ref. |38| finds that already at order a the matching depends on the precise correlation 
functions used for the matching, with a nonuniversal result for the ratio Z A /Z° A \ I refer to 
Ref. [38[ for further discussion of this second step. Here, I will address the first step: the 
mapping to operators in the chiral theory, to order a. 42 



In order to find the ChPT expression for the axial current, Ref. [38| first observes that, in 
the case that ca = 0, 1>a — and Z A = 1, the Symanzik current is just the continuum axial 
current, which can be rotated into the vector continuum current by a chiral transformation. 
This fixes the ca = part of the axial current in ChPT in terms of the vector current. It 
turns out that the vector current is given by the Noether current of the chiral theory, because 
the 0(a) terms in the vector current are of the form ad u (qa fMU T a q), which is automatically 
conserved. 43 The Noether current following from + AC^ (cf. Eqs. (I2.33P and ( 13.1 5p ) is 

V, = (J? + J L ,) (l + ^ tr(it S + SU)) + ^ { J* + J£ , it S + E ti} > ( 3. 31) 

with J^' L defined to lowest order in Eq. (j2.26P ; we omitted terms coming from of 

Eq. f[27T7|) . Here A = 2W A with A the spurion of Eq. ( ET9J) . We should set A = a of course, 
but only after doing the chiral rotation to find the ca = part of the axial current. We thus 
find 

^ = (^-^)(l + ^tr(S + St)) + ^{j;-^S + St} (ca = 0), (3.32) 

where now we have set A = a. 

Finally, we have to worry about the ca term in Eq. (13.301) . This is rather simple, because 
for this we need the lowest-order ChPT operator for the pseudoscalar density, which is 
— i(E — E^). We need to introduce a new LEC Wa for this contribution, since, unlike the 
contribution in Eq. ( 13.32[) . this term does not relate in any way to the Symanzik effective 
action, and the same is thus true in ChPT. We thus find for the complete expression for the 
axial current to order a: 

A, = (J? - Jt ) (l + ^ tr(E + Et) ) + *™L { J« - Jf, E + Et} (3.33) 

-UW A c A d f (T, - Et) (c A + 0) , 
where, following Ref. [38|, I kept ca outside Wa- m In closing this section, I should emphasize 



that the derivation given here is the "quick and dirty" derivation — for a careful analysis of 
both vector and axial currents, I refer to Ref. 13811 . 



F. Staggered fermions 

Before developing ChPT for staggered QCD, I give a brief review of the definition and 
symmetry properties of lattice QCD with staggered fermions, to make these lectures more 
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This step was already carried out in Ref. [40( using: 



the source method; see also Refs. 



43 There is a vector Noether current on the lattice 

44 This result can now be multiplied again by the factor (1 



39], but often the local vector current is used instead. 
b A a)/Z° A . 



29 



self-contained. 45 



1 . Brief review 

The staggered action is (for the time being I will set the lattice spacing a = I, but I will 
restore it when we get to the SET and ChPT) 

S =Y1 \ V»( x )Xi(%) \u^{x)xii.x + 11) - Ufa - n)xi{x - n)] +^2 m iXi{x)Xi{x) , 

x,fj,,i x,i 

Vlx (x) = (_i)*i+-+*m-i . (3.34) 

Here Xi is a lattice fermion field with an explicit flavor index i — 1, . . . , Nf, an implicit color 
index, and no Dirac index; U^{x) are the gauge-field links, with values in SU(3) co i or . 

The action (13.341) has species doubling, and because of the hypercubic structure of the 
theory, this doubling is sixteen-fold. The sixteen components which emerge in the continuum 
limit can be accounted for by an index pair a, a, with each index running over the values 
1, . . . ,4. It turns out that the first index can be interpreted as a Dirac index, while the 



second index constitutes an additional flavor label; the continuum limit looks like [42H44 



S- = /^(? i -7^ w . + (3.35) 

in which repeated indices are summed. In brief, this structure emerges as follows. Species 
doublers live near the corners 

7T A G {(0, 0, 0, 0), (vr, 0, 0, 0), . . . , (vr, vr, vr, vr)} , A = 1, . . . , 16 , (3.36) 

in the Brillouin zone. It thus makes sense to split the lattice momenta p into 
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p = 7i A + k , -- < k„ < - . (3.37) 



We thus identify sixteen different fermion fields in momentum space, x{p) = xi^A + k) = 
XA{k), which represent the sixteen doublers. Since the phases rj^(x) insert momenta with 
values in the set (13.361) . it follows that the operation 

T ±fl : Xi(x) -)■ Vv(x)xi(x ± fi) (3.38) 

mixes the sixteen doublers. In momentum space, after a basis transformation, the operations 
Tjj can be represented by 

T ±ll : q iaa {k) ->• 7^e ±ik ^q i(3a (k) , (3.39) 



with the 7 M a set of Dirac matrices. That these four matrices satisfy a Dirac algebra follows 

nmute: 

T^T V = -T V T^ , n ^ v , (3.40) 



from the fact that the T M anticommute: 
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For more introduction, see the lectures by Pilar Hernandez at this school, as well as the recent review by 
the MILC collaboration [41 [. 
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and that 

7%: Xi(x)->Xi(^±2Ai). (3.41) 

Using these ingredients, we can see how Eq. f l3.35|) follows from Eq. (I3.34p . Using Eqs. ( 13 .38j) 
and (I3.39p . the free kinetic term in Eq. (I3.34p can be written as (for one flavor) 

J2\x(x) (T, - T^) X (x) = J -0^ E9(*)7m<™(*m)?(*) • ( 3 - 42 ) 

In the continuum limit, sin(A; /Lt ) — > k^; note that k is restricted to the reduced Brillouin 
zone (c/. Eq. (I3.37P ). on which no species doublers reside. Finally, since Eq. (I3.34p is gauge 
invariant, the continuum limit f !3.35|) has to be gauge invariant as well. It follows that, at 
least classically, Eq. ( I3.35|) is the continuum limit of Eq. (I3.34p . 

The observation that the continuum limit of staggered QCD is given by Eq. (I3.35P has 
been proven to one loop in perturbation theory, and this result has been understood in terms 
of the exact lattice symmetries, making it likely that this proof can be extended to all orders 



43J. There is also extensive numerical evidence [41|. We conclude that the lattice theory 
defined by Eq. (I3.34p constitutes a regularization of QCD with ANf flavors. In modern 
parlance, "flavor" is now usually used to denote the explicit multiplicity associated with 
the flavor index i, while "taste" is used to denote the implicit multiplicity associated with 
species doubling. If indeed the continuum limit is given by Eq. (I3.35p . this implies that a 
full U (4) taste symmetry emerges in the continuum limit, for each staggered flavor. This taste 
symmetry is not present in the lattice theory — we will return to this shortly. 

We note that the taste degeneracy is somewhat of an embarrassment, if one wishes to 
use the flavor index i as physical flavor, as is usually done in practice. While this is not 
a problem for valence quarks (one simply picks out, say, taste a = 1 in order to construct 
operators, for example), this means that there are too many sea quarks — too many quarks 
on the internal fermion loops. This problem is solved in practice by taking the (positive) 
fourth root of the staggered determinant for each physical flavor. While this makes sense 
intuitively, it is a serious modification of the theory, which entails a violation of locality 



and unitarity at nonzero lattice spacing [45|. Much work has recently gone into attempts 
to show that the procedure is nevertheless correct in the sense that it yields the desired 
continuum limit, in which EFT techniques in fact play an important role. There is no space 
here to discuss this very interesting (and important!) topic; I refer to the recent reviews 



of Refs. [411 . I46l |. Here I will focus on the construction of staggered ChPT for unrooted 
staggered fermions. 

To construct the SET for staggered QCD it is important to use all exact symmetries 
of the lattice theory, in order to restrict the form of the operators that will appear in the 
Symanzik expansion, Eq. (13. 2p . The action (I3.34p has an exact vectorlike SU(Nf) symmetry, 
broken only if the quark masses m; are nondegenerate. Furthermore, there are spacetime 
symmetries. In addition to hypercubic rotations and parity, the action is invariant under 



shift symmetry [43J, |47 



S±, : Ux) Xi (x ± (J) , Ux) = (_i)*m+i+-+-4 ; (3 43) 

S±n '■ U v (x) ->■ U v (x ± £i) , 

because of the fact that 

C u (x)i]p(x)Cv(x +fi) = rjn( x + v ) ■ ( 3 -44) 
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On the basis qi aa shift symmetry is represented by 

Qiaa 

(k) -> ^, ab e ±lk -q tab (k) , (3.45) 

with £ M a different set of Dirac matrices (like Eq. (I3.40p we have that S^S U = —SuS^ for 
[i^v) acting on the taste index, commuting with the 7„ of Eq. (13. 39 p . because of Eq. (13.441) . 
Finally, if rrii = for a certain flavor, there is a chiral symmetry ( u U(l) e symmetry ) |48j] 

Xi{x) e ld ^ Xl (x) , Xi(x) e^xM , (3.46) 
e(x) = (_i)^i+^+^+^4 _ 

Since this symmetry is broken by a nonzero rrii, it should be interpreted as an axial symmetry 
of the theory. In addition, since it is an exact symmetry of the lattice theory (for m ; = 0) 
it has to correspond to a nonsinglet axial symmetry in the continuum limit, because the 
continuum singlet axial symmetry is anomalous. If all quarks are massless, it is not difficult 
to show that this enlarges to an exact U(Nf)L x U(Nf)n symmetry. 46 The axial symmetries 
in this group are nonsinglet symmetries on the basis qw, because the phase e(x) inserts 
momentum, and thus acts nontrivially on the index pair a, a: Since 

e(x)xi(x) = YlT-pSnXii?) , (3.47) 

it follows that U(l) e symmetry is represented on the q aa basis as 

qiaa^U 9 ^ 5 ) a 0b . (3.48) 

In staggered QCD there is extensive evidence that these axial symmetries are sponta- 
neously broken, and for each such broken symmetry there is thus an exact Nambu-Goldstone 
boson. When rrii is turned on, this NG boson picks up a mass-squared proportional to m,;, as 
we will see below. Contrary to the case of Wilson fermions, we thus know the critical values 
of the quark masses rrii — the massless limit is at m ; = 0. Interpolating fields for these 
Nambu-Goldstone bosons are t(x)Xii x )Xj{ x ) i n position space, or q^^Qj i n momentum 
space. 

As can be seen from Eq. (I3.35j) . these lattice symmetries enlarge to SU (4iV/) 1 x SU(ANf) R 
(and the usual euclidean spacetime symmetries) in the continuum limit. In the contin- 
uum limit, one thus expects (4iV/) 2 — 1 pions, associated with the spontaneous breakdown 
SU(AN f ) L x SU(AN } ) R -> SU(AN f ) v . At nonzero a, there are only Nj exact Nambu- 
Goldstone bosons; the rest will pick up additional contributions to their mass-squared of 
order a 2 , as we will see in more detail below. These lattice- artifact mass splittings are usu- 
ally referred to as "taste splittings," since they occur because of the breakdown of U(A) taste 
on the lattice. Indeed, this taste breaking is seen on the lattice very clearly in the meson 
spectrum, which is shown in Fig. El Clearly, we would like to understand these results with 
help of a ChPT framework. 
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U (Nf) instead of SU(Nf), because, as we just noted, the axial symmetries in this group are all nonsinglet! 
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FIG. 3: (a) Taste splittings between pions made out of an up and (anti)down staggered quarks, in 
different representations of the staggered symmetry group, at fixed lattice spacing, a = 0.125 fm. 
The black points correspond to the exact Nambu-Goldstone boson, lines are chiral fits using 
staggered ChPT. (b) Taste splittings between the various nonexact Nambu-Goldstone bosons and 
the exact Namubu-Goldstone boson as a function of lattice spacing. These figures show the 
behavior predicted by Eq. (|3.63[) . From Ref. [Isj], to which I refer for a detailed explanation. 



2. Staggered ChPT 

The SET is a continuum theory. This means that it is invariant under translations over 
an arbirary distance; in particular, it is invariant under translations over a distance a in 
any direction. 47 It is also invariant under any of the exact lattice symmetries. We may 
thus combine a shift in the positive [i direction by a translation over a distance a in the 



negative \x direction. Combining these two, we see that the SET is invariant under [26|, |49 



Qi(k) -> ZMk) . (3.49) 

At the level of the SET, the "translation" and "taste" parts of shift symmetry thus decouple, 
and we find that the SET thus has to be invariant under the 32-element group T 4 of discrete 
taste transformations generated by £ M , which is a subgroup of U(A) taste . A& We can thus use 
this group in order to restrict operators at each order in the Symanzik expansion, along with 
the other exact lattice symmetries. 

The first term in the Symanzik expansion, Cg , is given by (the integrand of) Eq. (I3.35p . 
There are no dimension-five operators that can be constructed from the quark fields q and 



47 In contrast, staggered QCD is only invariant under ordinary translations over a distance la in any of the 
principal lattice directions. 

48 Note that this group does not become larger due to the presence of Nf flavors, because the gluon field also 
transforms nontrivially under shifts, cf. Eq. (I3.43[) . Thus all quark flavors must be shifted together. But 
because C/ M shifts by a pure translation, the analogue of Eq. (|3.49p for gluons is trivial: U^p) — > U^p). 
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FIG. 4: Generation of taste-breaking four-fermion vertices through short-distance gluon exchange. 



q that respect the lattice symmetries [50(]. Ignoring U(l) e symmetry, the possible dimension 
five operators are those of Eq. (13.51) . The first two operators in this list are immediately 
excluded by Eq. ( 13.48ft . The three other terms involve powers of the quark mass, and in 
order to make use of U(l) e symmetry, we thus have make the quark mass a spurion, as in 
Eqs. (12. ip and ( 12. 5p . but we now do it with L and R projectors on the quark fields defined 
with 75^5, instead of 75. It is then not difficult to see that also the three last operators in 
Eq. (13. 5p are excluded. 

This implies that scaling violations for staggered QCD start at order a 2 . In fact, before 
we start the discussion of we should rethink power counting. If we use GSM power 
counting, all terms of order a 2 would be 0(p 4 ). However, it turns out that scaling violations 
with staggered fermions, even though formally 0(a 2 ), are numerically rather large, even 
with improved staggered actions, at current values of the lattice spacing. 49 From the taste 
splittings in pseudoscalar mesons, it seems that the contribution of the quark masses and 
0(a 2 ) scaling violations to their masses is roughly equal, and we will therefore use a power 
counting in which p 2 ~ m quar k-^QCD ~ c^-^qcd- This means that 0(a 2 ) effects are of 
leading order in ChPT, and we would like to construct at least the 0(a 2 ) part of the chiral 
lagrangian, because from the 0(p 2 ) lagrangian we can obtain the 0(p 2 ) and nonanalytic 
0(p 4 ) parts of physical quantities in terms of quark masses and the lattice spacing. We 
thus set out to find the 0(a 2 ) part of C^ 2 \ cf. Eq. (I2.33j) . This new part contains no quark 
masses, and no derivatives, each of which would make such terms higher order in the chiral 
expansion. 



The 0(a 2 ) part of the SET, contains a large number of terms 49_|, |5l|, and I will 
only give some examples here. First, there are the purely gluonic terms which we already 
discussed in the context of Wilson ChPT, and thus do not have to revisit again. Similarly, 
fermion bilinears in C^p do not lead to any new terms at the desired order in ChPT [49]. 
The remaining terms are all four-fermion operators. 

Taste-breaking, short distance four-fermion operators occur in the theory because of 



49 This state of affairs may change in the reiatively near future, with more highly improved lattice actions, 
and even smaller lattice spacings. 
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diagrams as shown in Fig. HI The gluon carries a momentum near one of the values in 
Eq. (I3.36p . with n > components near n/a, thus changing the taste of the staggered 
quarks at both vertices. For such gluon momenta, we have that the gluon propagator 



a 2 



£ M 4, sin 2 (§a P/ ,) ~ An ' (3 ' 50) 

Effectively, this generates a four-fermion operator in the SET (in which gluons with momenta 
of order 1/a have been integrated out), for instance of the form 50 

0\ = {q^suqi)(qj^u5Qj) , (3-51) 

in which —^5 = ^ u = £5^, and all repeated indices are summed. This operator is invariant 
under the group T 4 , but not under the full continuum taste group U(4) taste (it is, in fact, 
invariant under SO(4) tas te C U(4) ta ste, accidentally). Since the gluon carries no flavor, the 
flavor index structure has to be as indicated. Note that the spin <g> taste matrix appearing in 
each quark bilinear has to correspond to an odd number of applications of T M or S^, because 
gluons couple only to the kinetic term in Eq. (13.341) . which is invariant under U(l) e . 

Note that the appearance of four-fermion operators due to the exchange of "heavy" gluons 
is analogous to the appearance of four-fermion terms with coefficients of order g 2 ea fc/M^ 
(the Fermi EFT) in the Standard Model at low energy because of the exchange of the heavy 
W and Z bosons. Likewise, in the case of hand, the four-fermion operators in the SET are 
of order a s , the strong "fine-structure" constant. Light gluons, with momenta p C 1/a are 
still present in the SET, i.e. , they have not been integrated out. 

The spin matrices 7^ can also appear in these four-fermion operators. This can happen 
in two ways, here is an example of each: 



°2 = SCffiT/i&ftXffjTM&g?) > ( 3 - 52 ) 
^3 = ^(^ilu^uqij^jlviu^j) , (3.53) 



fJ.<U 



in which ^ = £ M £„ (/i ^ u), and where we now showed the sums over Lorentz indices explic- 
itly. As before, neither of these operators is invariant under U(4) t aste, and both are invariant 
under r 4 (as well as all other lattice symmetries, in particular hypercubic rotations). The 
first operator is separately invariant under SO(4) euc udean x SO(4) taste , the second is not. 
Operators of the second type can only be represented in ChPT with at least two derivatives 
(and one factor a 2 ) [52j, which are of order p 4 . 51 Only operators of the first type have 0(p 2 ) 
representations in ChPT. An interesting observation is thus that at lowest order staggered 



ChPT therefore has SO(A) taste symmetry, and not just the required T 4 symmetry [49 



Four-fermion operators can be color unmixed or mixed. For instance, Eq. (|3.51l) can take the form 
(Sia^5vQia,)(MjbCu5Qjb) or (Mia^5vQib)(Sjb^v5ilja}, where a and b are color indices. We see that color indices 
can be contracted in two different ways. They will be omitted here, because it makes no difference for our 
analysis. 

Now both derivatives and £ p matrices can provide Lorentz indices, so the argument following Eq. (|3.6p 
does not apply. 
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Finally, we need to make the transition from the SET to ChPT, for operators such 
as (13.511) and (I3.52p . For Nf = 3, our theory is now an SU(12) theory, because of the three 
flavors times four tastes. The nonlinear field 

^ia,jb ~ qialAjbR > (3.54) 

is thus an SU(12) valued field, with an index structure as shown. In order to streamline our 
notation, we introduce matrices 51 



^ 

e£ 8) = ^ I . (3.55) 
so that we can write for example Eq. (I3.5ip as {q^ 3 Jq) (q^j/sl)- We now rewrite 

-(q&JqmSq) = {q R ^q L + mSqr) 2 (3-56) 
-> (qr x r(1l + q L x LqR) 2 , 

where in the last line we introduced two spurions transforming as 

X L -> U L X L U R , X R U R X R Ul , (3.57) 

while they are interchanged by parity. With the field S we can construct three types of 
terms in ChPT with these spurion fields that do not contain the quark mass matrix or any 
derivatives: 

f tr(X«£) tr(X L £t) , 
(q^ 3 Jq)(q6h) ~> (tr(X^S)) 2 + (tr(X L £t)) 2 , (3.58) 
( tr(X i? SX /? S)+tr(X L StX L St) , 

which, setting the spurions equal to the values that reproduce Oi, gives the three ChPT 
operators 

tr(dJE) tr(dJst) , (tr(d?S)) 2 + (trtfg , tr^E^E) + tr^E^t) , 

(3.59) 

in which sums over the index v are understood. 

(3) 

Introducing spurions Yl,r for the taste matrices £5 in O2, which transform as Yi iR 
Ul,rXl,rUl r, and working through a similar exercise, one finds the (unique) ChPT operator 
representing C 2 : 

Efe7,d 3) %)(^7,d\) tr(efEefEt) . ( 3 .eo) 

A 4 

Referring to Ref. jUJ for further details, we quote the 0(a 2 ) contribution to the chiral 
lagrangian, to be added to Eq. (12. 33ft . Writing this part as a 2 V = a 2 {U +W), one finds that 

-U = C 1 tr (Cf SCf St) + \C, £ [tr (tf >E£? >E) + h.c] (3.61) 

+\c, e ^ (e£?seS } s) + h.c] + c 6 e tr (egsegst) , 

-Z/ = -C 2V E [tr(^ 3) S) tr(#E) + h.c] + -C 2A E [tr^? E) tr^E) + h.c/ 



^ L-v-sw -/ --vm, -/ ■ j ■ , 

+^5v Etr(^ 3) S) tr(^ 3) St) + ^ E^dJs) tr(£$Et; 
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Let us briefly consider some of the physics results that can be obtained from the 0(p 2 ) 
chiral lagrangian, which is the sum of Eqs. f !2.33|) (for a twelve-flavor continuum theory) 
and (I3.6ip . First, consider meson masses at tree level. Here I will only quote results for fla- 
vored (i.e. , off- dia gon al in Eq. (12. 6p ) masses. The flavor-neutral sector is more complicated, 
and I refer to Ref. 5l[ for details. 



Each of the entries in Eq. ( 12 .6p is now a four-by-four taste matrix, and we can thus expand 
each entry on a set of U (4) taste generators: 

1 16 _ 

0« = 9 12 -f^H ' i'j = u ' s ' ( 3 - 62 ) 

A F=l 

By expanding £ to quadratic order, we can then read off the meson masses from Eqs. ( I2.33P 
and f)3.6ip . In the flavored sector, only U contributes, and we find 

m 2 Fij = B (rrii + rrij) + a 2 A(E F ) , i, j — u, d,s, i^j, (3.63) 

in which 

(3.64) 



Afe) 


= o, 

16 

~ P 


(d + 3C 3 + C 4 + 3C, 


A(z^) 


16 

: P 


(2C 3 + 2C 4 + 4C 6 ) , 


HQ 


16 

~ P 


(d + C 3 + 3C 4 + 3C ( 


A(^) 


16 

~ P 


(4C 3 + 4C 4 ) . 



As we expect, there is one exact flavor multiplet of Nambu-Goldstone bosons, because of 
U(l) e symmetry, with a mass-squared proportional to the quark masses, as in the continuum. 
All other meson masses are shifted by terms of order a 2 , and Fig. |3] is in good agreement 
with the behavior predicted by Eq. f)3.63p . The numerical results show that C 4 must be 
dominant among C*i, 3,4,6- This is consistent with the fact that, as previously mentioned, 
U (4) taste is broken at this order in ChPT to 5*0(4), not all the way down to T 4 . Thus there 
are only five nondegenerate meson masses j49|, whereas the irreducible representations of 
the staggered symmetry group would allow eight (53[. 

We end this section by quoting a one-loop result that can be obtained from our lowest- 
order staggered chiral lagrangian. But before we do this, we have to address the fact that 
the SU (12) theory has too many sea quarks, which on the lattice side is fixed by taking the 
fourth root of each staggered fermion determinant. The question is how to carry over this 
procedure to staggered ChPT. The way this can be done is through the "replica trick." One 
starts with adding more fermions to the theory by using n r instead of one staggered fermions 
for each physical flavor. For each of the n r up quarks, the quark mass is kept degenerate - 
all up quarks have mass m u , and likewise for down and strange quarks. In the lattice QCD 
path integral, this results in raising the staggered determinant for each flavor to the n r -th 
power. The idea is now to do all calculations keeping the n r dependence, and then at the 
end set n r = 1/4, which corresponds precisely to taking the fourth root of the determinants. 
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As we already mentioned, it is by no means obvious that the fourth-root procedure is 
field-theoretically legitimate. However, if it is, it is possible to show that the replica trick is 
the correct way of incorporating the effects of the fourth root in staggered ChPT (and EFTs 
for staggered QCD in general) 0, 54j. Assuming that this all works, we quote the 0(p 4 ) 



result for the two-flavor £ 5 -pion decay constant (which is the real Nambu-Goldstone boson 
for the breakdown of U(l) e symmetry), with equal light quark masses, m u = = mf. 52 

/r 1 1 £ L(2B m e + a 2 A(E B )) (3.65) 



./ 



B 



4 (L{2B Q m + a 2 5' v ) + L{2B Q m + a 2 5' A ) - 2L(2B m) 



in which 



5'v = j[ ( C 2V - C w ) , (3.66) 
16 



f 



5 A) 



and F is a linear combination of 0(p 4 ) LECs. 53 Note that now the LECs are low-energy 
constants in the Nf = 2 theory. The LECs C2v,a and C^v,a only appear in flavor-neutral 
meson masses at tree level (as can be seen by expanding out W), and their appearance 
reflects the fact that flavor-neutral mesons appear in the one-loop corrections. Flavored 
mesons also appear in the loops; producing the logarithms in the sum over B. 

We note that the masses as they appear in chiral logarithms get 0(a 2 ) corrections. This 
is of course natural, because all lattice mesons can in principle appear on loops, and most 
of these masses get 0(a 2 ) corrections, already at lowest order in ChPT. This change in the 
logarithms points at a reason as to why it is important to incorporate scaling violations 
in ChPT systematically: as long as m quar k ~ cl 2 Aq CD , one cannot expand the logarithm in 
Eq. (I3.65P in powers of a 2 . If we would have naively assumed that all scaling violations can be 
understood in terms of simple powers of a, we would have missed this nonanalytic dependence 
on the lattice spacing. The 0(a 2 ) term inside the logarithm reduces the curvature for small 
quark masses. Knowing the field-theoretical form of this nonanalytic behavior makes it thus 
feasible to explore smaller quark masses at a given lattice spacing than otherwise would be 
possible. In Fig. El I show a plot of the staggered ChPT fits of the meson masses from the 
MILC collaboration (which, of course, use the full nondegenerate 2 + 1-flavor and partially 
quenched formulas of Ref. |55|). The indices x and y denote the flavors of the valence quarks, 
which for most of the data points in this figure have masses different from the sea quarks, 
a generalization we will consider in the next section. In this figure, we see that indeed the 
curvature on the lattice is significantly different from that of the continuum curve. 

Finally, we observe that the form of the tree- level masses, Eq. (13. 63 p . is very similar to 
Eq. (13. 22 p . That raises the question whether, since we are working in the LCE regime in 



52 This result can be derived from Eq. (27) of Ref. [55j, by making the strange quark heavy (a situation 
we will discuss more in Sec. 5). For nondegenerate results, see Ref. 55], for 0(p 4 ) meson masses, sec 



Ref. 



51|. 



' 3 The complete 0(p 4 ) staggered chiral lagrangian has been worked out in Ref. [52 1 



38 




FIG. 5: m^l{m x + m y ) as a function of m x + m y . The continuous curves are (partially quenched) 
chiral fits at fixed lattice spacing; the dashed line is the (partially quenched) continuum line (with 
sea quark masses corresponding to those of the "superfine" 0.018/0.0072 ensemble), which has a 
much stronger curvature. Note how the curvature is reduced by nonvanishing lattice spacing. The 
red dashed curve is the fully unquenched continuum curve. m' s is the strange sea quark mass and 



rh! is the light seaquark mass. For more explanation of the meaning of all symbols, see Ref. la] . 
Figure courtesy C. Bernard. 



which m quark ~ cl 2 Aq CD , Aoki-like phases could occur for small enough quark mass. Using 
techniques similar to those used in QCD inequalities 54 it can be argued that all A(Hg) > 0, 
excluding this route to a possible Aoki-like phase. However, this is not true for the 0(a 2 ) 
corrections in the flavor-neutral sector, which are governed by the LECs S' VA . While no 
sign of an Aoki-like phase has been observed in numerical computations, this remains an 



interesting theoretical possibility |51|, [57 



IV. CHOOSING VALENCE AND SEA QUARKS TO BE DIFFERENT 

Consider the two-point function for a charged pion in euclidean QCD, 



(ir + (x)rr-(y)} = ((u(x)^d(x))(d(y)^u(y))) (4.1) 
1 
Z 



54 For a review, see Ref. [5' 
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= ~\ I W U n\ Det (^ + M ) tr h (P + mu)- 1 (y, x) l5 (p + m.y 1 (x, y) 

In an actual lattice computation, one usually does not use point sources and sinks; instead 
one sums with some weight over the timeslices at t = x\ and t' — but this will not affect 
the observation we want to make. 

In the third line of Eq. (14. ip . the quarks enter in two clearly distinct ways: through the 
determinant, and through the quark propagators that follow from carrying out the Wick 
contractions. We refer to quarks of the first type as sea quarks, and those of the latter 
type as valence quarks. Of course, in nature these two types of quarks are the same, and 
they have to be in order to preserve unitarity of the theory. But nothing stops us on the 
lattice from taking the quark masses in the propagators unequal to those in the determinant, 
or, even more drastically, from choosing a completely different discretization for p in the 
propagators and in the determinant! Let us first consider whether this seemingly sacrilegious 
idea might have any advantages, and if so, whether we can make any field-theoretical sense 
out of it. 

First, consider only taking valence quark masses unequal to sea quark masses. This is a 
situation commonly referred to as partial quenching. This name has its origin in the fact 
that as we send the sea quark masses to infinity, we effectively remove the determinant 
from the path integral, thus reverting to the quenched approximation. Unlike the quenched 
approximation, however, as long as we keep the sea quark masses in the theory, we can 
always recover the physical theory by choosing the valence and sea quark masses equal (for 
each flavor) [58[ . In other words, real or "full" QCD is a "special case" of partially quenched 
QCD (PQQCD). 55 

This observation should then carry over to any EFT for PQQCD, and thus, in particular, 
to partially quenched ChPT (PQChPT). (Of course, we will have to reconsider the arguments 
for the existence of ChPT, a point to which we will return below.) What we will see is that, 
if both sea and valence quark masses are light enough for the chiral expansion to apply, 
being able to vary the valence and sea quark masses independently is very useful: It gives 
us an extra tool for matching lattice QCD computations to ChPT, and thus determine the 
numerical values of the LECs at some scale — which pin down the EFT quantitatively. The 
key observation here is that those LECs are the same in the full and partially quenched 
theories, because they are independent of the quark masses, and thus also of the distinction 
that we make between valence and sea quark masses in the partially quenched generalization 
of QCD [5^|. In addition, a very practical consideration is that in most cases it is much 
less expensive in terms of computational cost to vary the valence quark masses (which only 
show up in the propagators coming from the external operators) than the sea quark masses, 
which are part of the effective gauge action used to generate an ensemble of gauge-field 
configurations. We will consider partial quenching in detail in Sees. IIVBI and IIV CI 

The motivation for choosing even the lattice Dirac operators different for valence and 
sea quarks is different. Of course, the basic assumption will be that in the continuum 
limit the difference in discretization disappears if the bare valence and sea quark masses 
are tuned to yield the same renormalized quark mass - - an extension of the notion of 



If one chooses the number of light sea quarks to be equal to three! In that case, the name "partially 
quenched" is slightly unfortunate. However, plenty of lattice computations are still really partially 
quenched, in that they include only two sea quarks. 
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universality. But, the symmetry properties of a correlation function such as Eq. (14. ip are 
determined by the symmetry properties of the valence Dirac operator, and in many cases it 
is a great advantage to be able to use a discretization with symmetries (in particular chiral 
symmetry) as close as possible to those of the continuum. Often, because of the smaller 
amount of symmetry on the lattice, operator mixing is more severe on the lattice, making 
computations of, for instance, weak matrix elements much harder, if not impossible. More 
symmetry, in particular more chiral and flavor symmetry, means less mixing. However, 
lattice Dirac operators with good symmetry properties, such as domain-wall and overlap 
operators, are numerically very expensive. Since a large part of the computational effort 
goes into evaluating the determinant, it makes sense to choose a less expensive lattice version 
for the sea quark Dirac operator, such as Wilson or staggered operators. This "mixed-action" 
option will be discussed in Sec. IIVDI 

Having extolled the possible virtues, we should address the question whether these gen- 
eralizations of QCD make any field-theoretical sense. 



A. Path integral 

Of course, operationally, the procedure of choosing different Dirac operators for the va- 
lence and sea quarks outlined in the previous section is well defined. But, does it make 
field-theoretical sense? For instance, can we hope to develop EFTs for these generalized, 
but — at this stage — suspect versions of QCD? Clearly, in order to address these questions, 
it would help if we can write down a path integral expression that reflects the distinction 



between valence and sea quarks. This can be done using a trick proposed by Morel [60 
Obviously, separate fermionic fields are needed for sea and valence quarks, but doing just 
this would lead to fermion determinants for both. The trick is to cancel the valence deter- 
minant by introducing yet another set of quark fields, with Dirac operator and mass matrix 
identical to those of the valence quarks, but with opposite (bosonic) statistics. We will refer 
to these fields as ghosts. 56 The fermionic part of the QCD lagrangian generalizes to 57 

C = q s (D s + M s )q s + q v (D v + M v )q v + q\D v + M v )q, (4.2) 

in which q denotes the bosonic ghost-quark fields; D s is the lattice Dirac operator for sea 
quarks, and D v that for valence quarks. In general, we will label sea quantities with subscript 
s, valence quantities with subscript v, and put tildes on entities referring to ghosts. Note 
that the ghost and antighost fields, q and q\ are not independent. The path integral for 
this theory is 

Z = J I[[dU»] YlidqJ [dq s>i ] Y[[dq Vlj ] [dq v , 3 ] [da]} [dq ] (4.3) 

V i j 

exp ( -S gauge - Y$e(P* + M Ms + q v (D v + M v )q v + q ] {D v + M v )q) j 



56 These ghosts have nothing to do with Faddeev-Popov ghosts! They are called ghosts, because they share 
the similarly "incorrect" spin-statistics properties. 

57 I write the Dirac operators D s and D v without a slash, because not all lattice Dirac operators are of the 
form p = 
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= / H[dUp] exp{-S gauge ) Det {D s + M s ) , 

which follows because of the exact cancellation of the valence and ghost determinants. Intu- 
itively, at the level of diagrams, valence and ghost loops cancel, because they always come 
in pairs with opposite signs between them. Clearly, it is possible to couple the path integral 
to sources for all quark fields, and we can thus generate correlation functions for operators 
made out of any of these fields. In numerical computations, of course the ghost fields are 
never introduced in the first place. But one still has access to correlation functions that 
include ghost quarks, because after doing the Wick contractions one simply finds that each 
ghost- ant ighost contraction is replaced by a valence propagator. For example, if one consid- 
ers the pion two-point function of Eq. (14. ip for pions made entirely out of ghost quarks, one 
finds exactly the same expression as on the last line of Eq. (14. ip . except without the minus 
sign, which would be absent because ghost fields commute, rather than anticommute. 

In order for definition ( 14. 3 p to make sense, we have to make sure that the integral over 
ghost fields converges. This is the case if D v is antihermitian, as it is in the continuum and 
for staggered fermions, for instance. In addition, we have to require that the mass matrix, 
M v , has strictly positive eigenvalues. For Wilson or domain- wall fermions the Dirac operator 
is not antihermitian (or hermitian), and the real part of its eigenvalues can be of both signs. 
For a method of defining PQ or mixed-action QCD with valence Wilson fermions, I refer to 
Ref. [6l[. 58 For overlap fermions, the eigenvalues are also complex, but one can prove that 
if the overlap operator satisfies = 75D75, all eigenvalues have a nonnegative real part. 
Thus, the ghost integral converges again as long as M v has only positive eigenvalues. 

It is clear that, while a euclidean path- integral definition can be given, this theory is sick. 
The ghost quarks have the wrong statistics for a spin- 1/2 field. Remember that if you try to 
quantize a spin-1/2 field with the wrong statistics, one of the problems that occurs is that 
the hamiltonian of the theory is unbounded from below j62[! This and other [63] aspects 
indicate that it may be problematic to continue the theory to a theory in Minkowski space 
with a positive hamiltonian. But even if we could continue to Minkowski space, there is 
a problem with unitarity if one considers correlation functions with valence quarks on the 
external lines, while different sea quarks run on the loops. 59 

Fortunately, lattice QCD, as applied in modern numerical computations, does not want 
to be in Minkowski space. It is perfectly fine to match euclidean lattice computations to a 
euclidean EFT. Once the EFT has been completely defined by obtaining the values of all its 
LECs, we can take the continuum limit, set valence quark masses equal to sea quark masses, 
and continue to Minkowski space. The fact that PQQCD and mixed-action QCD can only 
be defined in euclidean space is not a problem. 

However, for all this to work, we do need to develop ChPT for partially quenched and 
mixed-action theories, and it is not obvious that this can be done. The generalized theory 
does not satisfy some of the cherished properties of a healthy quantum field theory, in 
particular unitarity, 60 which are usually invoked to argue that a local, unitary EFT such as 
ChPT has to exist, and, is in fact the most general possibility [Tof. We will say more about 
this in Sec. IVB]for PQQCD, and in Sec. HVDl for mixed-action QCD. 



A similar method as described in Ref. 

61] 

exists for domain- wall fermions. 
It is clear that this sickness already occurs in weak-coupling perturbation theory. 

And causality, if these generalized theories only live in euclidean space. However, since PQQCD and mixed- 
action QCD are local theories, we expect that the cluster property is realized on correlation functions. 
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Before we delve into the specifics, let us discuss some generalities considering the symme- 
tries of Eq. (14. 2p . If we have N s sea quarks (we will always take N s = 3 or 2, as in previous 
sections) and N v valence quarks (we can always pick as many valence quarks as we need, 
so N v is arbitrary), it looks like the chiral symmetry group of Eq. (14.21) (for M s = and 
M v = 0) is [U(N S ) L x U(N S ) R ] x [U(N V \N V ) L x U(N V \N V ) R }. 61 Here U(N\N) is a "graded" 
group, which we will discuss in more detail in the next section; 62 for now, all we need is 
that the symmetry group of the ghost part of the lagrangian Eq. (14 .2p alone would be the 



subgroup U(N v )l x U(N v ) r . It turns out that this is not quite correct (6l|, |64], |65| , as I will 
now explain. 

Since q 1 is not independent of q, it follows that the ghost part of Eq. (14. 2p is invariant 
under 

q L -> Vq L , q R -> V t_1 g fl , (4.4) 

with V G GL(N V ). Here 

1 1 
Ql = -(1 - 7s)g , qR : = -(1 + 7s)g , (4.5) 

from which it follows that (unlike for quark fields q and q, for which the chiral projections 
can be chosen independently, cf. Eq. (12. 2p ) 

?1 = ^(1-Tb), fe = ^(l + T 5 )- (4-6) 

Note that the definition of the projected g^'s is opposite to that of the projected <fs. Equa- 
tion (I4.4p then follows from the fact that 

$D v q = q ] L D v q R + q R D v q L . (4.7) 

Here I assumed that {.0^,75} = 0, which holds in the continuum. The lattice equivalent 
of the decomposition (14. 7p will depend on the type of lattice fermion employed, as already 
noted above. 63 But in all cases, the fact that q^ is not independent of q determines the 
precise form of the chiral symmetry group, and in all cases it has to coincide with GL(N V ) 
in the continuum limit. 

The vector subgroup, defined as the group which leaves the ghost condensate qLQL + Q R QR 
invariant, is the unitary subgroup, U(N V ) C GL(N V ). If we restrict ourselves momentarily to 
the ghost sector only, this implies that S has to be an element of the coset GL(N V )/U(N V ). 
If we want to write E in terms of a meson field analogous to in Eq. ( I2.6p . but with 
its meson component fields "made out of ghost quarks and antiquarks," we should write 
S = exp(20//), with <fi a hermitian matrix. The number of mesonic degrees of freedom is 
the same as it would be for the coset [U(N V ) L x U(N V ) R ]/U(N) V , but the expression for E 
in terms of the meson fields is different. 64 



In the partially quenched case, for which D s = D v in Eq. (14.2[) . the symmetry group is larger, as we will 

discuss in Sec. IIVB1 

See the text starting around Eq. (|4.10l) . 

For instance, for staggered fermions, one can define qu and using the generator e(x) of U(l) e transfor- 
mations instead of 75. 

I am skipping a number of further technical details about the precise symmetry group of Eq. (|4.2[) . Also, 
when all is said and done, the anomaly will have to be taken into account, as usual. For the latter, see 
Sec. UVBl 
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For nonperturbative issues, it is crucial to work with the correct symmetry group. This 
can already be seen from Eq. (14. 2\i : If we introduce spurion fields M and M to write the 
ghost mass term as 

q*M v q ->• q[Mq L + q R Mq R , (4.8) 

transforming as 

M -> V^MV- 1 , M -> VMV ] , (4.9) 

we see that both M and M remain strictly positive under GL(N V ) transformations, consis- 
tent with the convergence of the ghost path integral. This would not be true if we replace 
GL(N V ) by U(N v )l x U(N v ) r . 

One may want to use ChPT to find the phase structure of a quenched or partially 
quenched theory, as we did for Nf = 2 Wilson fermions in Sec. IIII CI [61]. In that case, 
one will have to be careful to start from the correct symmetries in constructing the chiral 
lagrangian. However, if one is in a phase in which the correct vacuum has been determined 
(and usually in applications, is given by E = 1), and one is only interested in calculating the 
chiral expansion of physical quantities, it can be shown that instead the symmetry group 



U(N V \N V ) L x U(N V \N V ) R can be used for constructing the chiral lagrangian [61l. |66|. This 
is what we will do in the next section. The reason is that in perturbation theory, which is 
defined by expanding E in terms of 0, we can perform a field redefinition — > i<f> that brings 
E into the "standard form," E = exp(2i(f) / f) , i.e. , the form that would have followed from 
starting with the ghost-sector symmetry group U(N v )l X U(N V ) R . 

As an aside for the interested reader, there is another way to define PQQCD and mixed- 



action QCD, through the replica trick [67|. Instead of introducing ghost quarks, one only 
introduces separate valence quarks, for each flavor. One then takes the limit rij — > at 
the end of a calculation. 65 Formally, this sends the valence determinant to a constant, thus 
removing it from the QCD partition function. For all positive integer values of the rij, this 
defines a physically sensible theory, in which we just happen to discretize different quarks in 
different ways. The disadvantage is that not much is known about the properties of the limit 
ni — > 0. In this case one bases the construction of the chiral lagrangian on the symmetry 
group [U(N S ) L x U(N S ) R ] x [U(N V ) L x U(N V ) R ] (with N v = Ei™i), and considers ChPT in 
the rii — > limit. In all cases where explicit calculations have been done, the results agree 
from those obtained with Morel's trick, using ghost quarks. 



B. Partially quenched ChPT 

In this section, we will consider ChPT for PQQCD, the generalization of QCD in which 
we only choose the masses of valence and sea quarks to be independent, while using the 
same discretization of the Dirac operator. The first question is whether indeed EFTs such 
as ChPT can be constructed for PQQCD. We will assume that it can, as a euclidean EFT for 
the underlying euclidean theory. There exists no proof of this assertion. Some observations 
in support can be made, however. 



The idea is similar to the usage of the replica trick in defining the fourth root with staggered fermions, 
hence the same name. Note however, that the aim is different: here we take rii — > 0, rather than continuing 
it to 1/4. 
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1. One expects the SET to exist, with almost the same level of rigor as in the physical case. 
This expectation is based on two observations: First, I expect that the perturbative 
construction of the SET goes through just the same for the partially quenched case. 
Then, the fact that the SET also is valid beyond perturbation theory amounts basically 
to the assumption that the coefficients in the Symanzik expansion have a well-defined 
meaning beyond perturbation theory, 66 and it does not look like a drastic step to expect 
that this assumption is also valid in the partially quenched case. While this observation 
does not directly address the validity of partially quenched ChPT (PQChPT), the 
SET is a "stepping stone," and its validity is important for the construction of the 
appropriate chiral theory. 

2. PQQCD contains full QCD as a subset: setting a valence quark mass equal to a sea 
quark mass makes that valence quark the same as a sea quark, and correlation functions 
with only sea quarks on the external lines are exactly those of full QCD. The collection 
of all correlation functions of PQQCD thus contains all correlation functions of full 
QCD as a subset. This allows us to make two observations: First, chiral symmetry 
breaking in the sea sector (i.e. , through a sea-quark condensate) takes place as usual. 
In PQQCD, one can use vectorlike symmetries to rotate sea quarks into valence quarks, 
thus producing also a valence condensate (and, by extension, a ghost condensate equal 
to the valence condensate). 67 Second, for full QCD the SET and ChPT exist, and it 
seems not unreasonable to assume that the match between PQQCD and PQChPT 
can be extended away from m v = m s , at least as long as the difference m v — m s does 
not become too large, relative to m s and m v . Below, we will have more to say about 
what "too large" means. 

3. The assumption that EFTs such as SET and ChPT exist for purely euclidean theories 
appears rather natural if one would imagine constructing the EFTs through the renor- 
malization group. In this framework one would expect the EFT to be local below the 
scale above which one has integrated out all modes through some type of renormaliza- 
tion group blocking procedure. To be sure, there is no proof that the EFT obtained if 
one actually knew how to carry out the renormalization group blocking is local. But 
it is a "folklore" that seems equally applicable to PQQCD as to the standard case of 



Based on these observations, we will assume that, as before, the lagrangian of the EFT 
of interest should be taken to be the most general local function of the fields consistent 
with the symmetries of the theory. In fact, it is interesting to follow this route, and see 
where it will lead us. Hopefully, PQChPT will exhibit, in a more concrete form, what the 
diseases of the theory are, and, hopefully, these will be the only diseases! If this is true, 
this makes it possible to use PQChPT in the interpretation of lattice results (read: use 
the predictions of PQChPT to fit data). If this works to a high degree of precision, that 
constitutes a nontrivial and important test of PQChPT, and thus of our understanding of 
PQQCD. As we will see, the tests become nontrivial once numerical computations "can 



An assumption that is used in nonpcrturbativc improvement. 

The argument that vector symmetries are not spontaneously broken [9( extends to continuum PQQCD. 
However, not all discretizations obey the conditions for this theorem, as the possible existence of the Aoki 
phase in QCD with Wilson fermions shows 



full QCD. 
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see" the nonanalytic terms predicted by PQChPT. Numerical evidence for such nonanalytic 
behavior exists, and I expect that it will fairly rapidly become more extensive, at least in 
the Nambu-Goldstone-boson sector. 

PQQCD has a much larger symmetry than Eq. (14.21) . because now sea, valence, and ghost 
quarks can all be rotated into each other. (Only the mass matrices cause a soft breaking 
of this symmetry.) Following our earlier discussion, we will now think of as independent 
of q, just as q is independent of q. The full chiral symmetry group (ignoring the mass 
matrix) is then SU(N S + N v \N v )l x SU(N s + N v \N v )r, where we now omit an anomalous 
axial U(l), as well as the U(l) for quark number, which is trivially represented on mesons. 
The nonlinear field E = exp(2i$//) now lives in the coset [SU(N S + N V \N V ) L x SU(N S + 
N V \N V ) R ]/SU(N S + N V \N V ) V , and correspondingly, the field $ describes (N s + 2N v f - 1 
mesons, counting precisely the number of pseudoscalar meson fields one can construct from 
the sea, valence, and ghost quarks and antiquarks when we leave out the singlet field that 
corresponds to the axial U(l) (see below). 



Before we continue, let us have a brief look at the graded group SU(N S + N V \N V ) [68 
An element U G SU(N S + N V \N V ) can be written in block form as 

u =(c l)< (410) 

in which A is an (N s + N v ) x (N s + N v ) matrix of commuting numbers, DaniV„x N v matrix 
of commuting numbers, while B and C are (N s + N v ) x N v and N v x (N s + N v ) matrices 
of anticommuting numbers, respectively. It is straightforward to check that if E has this 
structure, also $ has to have this structure. The quark fields 



Q= Qv I , Q = (q s q v q ] ) (4.11) 

transform in the fundamental and antifundamental representations of SU(N S + N v \N v ), es 
and in terms of these "super" quark fields, the PQQCD lagrangian takes the simple form 

CPQQCD = Cgauge + Q{D + M)Q , (4.12) 

in which Ai is the mass matrix 

M= M v I . (4.13) 





In the chiral theory, invariants are constructed using traces and determinants, and we 
need the generalization of these to graded groups. The "supertrace" of a matrix U as in 
Eq. (14.101) is defined as 

str(C7) = \x{A) -tr(D) . (4.14) 



Not all products of irreducible representations of graded groups are fully reducible. This plays a (minor 



role for weak matrix elements in the partially-quenched theory with N s = 2, see Ref. 69], and references 
therein. 
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The minus sign maintains the cyclic property, strf^iL^) = strf^tA)- The "superdetermi- 
nant" is then defined through 



sdet(tf) = exp(strlog(C/)) = det(A - BD^C) /det(D) , (4.15) 

for U of the form Eq. (14.1 Oft : this definition implies that sdet(UiU2) = sdet([/i)sdet(t/2). To 
understand this expression, first decompose U as 

TT (A B\ (I BD" 1 \ (A — BD~ X C 0\f 1 0\ . . 

u= {c d) = {o i ){ o d)\d^c i) ■ (4 - 16) 

Now, using the definition sdet(C7) = exp(str log(Z7)), the result follows, with the superdeter- 
minant of the first and last factors on the right-hand side being equal to one. 

Hermitian conjugation is defined as usual, with the proviso that complex conjugation 
reorders: (ab)* = b* a*, 69 from which it follows that (UiU 2 Y = U\u\. The group SU(N S + 
N V \N V ) is now defined as the group of all unitary graded (N s + 2N V ) x (N s + 2N V ) matrices, 
with grading as in Eq. (I4.10p . and superdeterminant equal to one. Clearly, if £ = exp(2z$//) 
has sdet(S) = 1, this is equivalent with str($) = 0. 

The matrix $ can be written in block form 

• = < 4 - 17 > 

in which contains all meson fields made out of fermionic quarks, contains all fields made 
out of ghost quarks, rj contains those made out of a fermionic quark and a ghost antiquark, 
whereas rj contains meson fields made out of a ghost quark and a fermionic antiquark. 
Note that <fi is not the same as Eq. (12. 6p . because it also contains meson fields made out 
of valence and sea quarks ("mixed" pions) and meson fields made out of valence quarks 
only ("valence" pions). The upper lefthand N s x N s block inside can be identified with 
Eq. (jtSD, for N s = 3. 

In practical applications, it turns out to be useful to relax the restriction that str($) = 0, 
which makes S 6 U(N S + N V \N V ) instead of SU(N S + N V \N V ). In conjunction, it is also 
useful to parametrize the diagonal of $ not in terms of physical fields (i.e. , those found by 
diagonalizing the mass matrix), but in terms of "single-flavor" fields: 

diag($) = (U, D, S, X, Y, . . . , X, Y , . . .) . (4.18) 

Here we have introduced a now commonly used notation for the valence quarks x,y,..., 
with U ~ mm, X ~ xx, etc., instead of u v , d v , . . ., to save writing indices. One can have as 
many valence quarks as one needs — since they do not contribute to the dynamics, their 
number need not be fixed. This explains the dots in Eq. (I4.18P ; of course, the number of 
ghost quarks needs to be equal to the number of valence quarks. 70 The "super" -singlet field 
(the "super-?/"), 

$ = str($) = U + D + S + X + Y + ...-(X + Y + ...), (4.19) 



Of course, this only makes a difference when both a and b are Grassmann variables. 
Within these lectures, I'll need only two valence quarks, x and y. But for instance for nonleptonic kaon 
decays, one needs three. 
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is not a Goldstone meson, because of the axial anomaly. This can easily be seen from the 
contribution of all quarks to the triangle diagram: in order for the ghost-quark contributions 
to add to the anomaly, one should include their contributions with an explicit minus sign, 
because they do not get a sign from the loop. This means that we should really remove the 
field $o from the chiral lagrangian (as we have been doing thus far). We will do this by 
giving it a large mass, that we will eventually send to infinity, thus decoupling the super- rf . 71 
It is time to get to the partially quenched chiral lagrangian! The lowest order form is 

= l -f str^E^E) - l -f str( X t£ + tf X ) + \m 2 Q (str($)) 2 , (4.20) 

in which x is a spurion for the quark masses, that should be set equal to 2B A4. We note that 
the field <3>o = str(<£>) is not constrained by symmetry (except parity), and thus we should 
really multiply every term in Eq. (14.201) by an arbitrary (even) function of $o- 72 However, 
we will be decoupling the super- rj', and that removes the dependence on all parameters 
contained in these potentials [66] . 

From this lagrangian, we can read off the lowest-order expression for all meson masses in 
the theory. This is straightforward for the off-diagonal fields in $. By expanding Eq. ( I4.20p 
to quadratic order, one find that the mass of a meson with flavors i and j is given by 



ni 



.2 



B (mi + rrij) . (4.21) 



The only thing "different" is that the ghost-meson propagators get an extra minus sign 
because of the supertrace in Eq. (I4.20p . 

(fe(p)fei(?)> = -TZ~^ 5 (P ~ ?) • ( 4 - 22 ) 

This is a first example of how PQChPT makes the "diseases" of PQQCD visible. For 
fermionic mesons, one has to remember that the ordering of r) and rj matters. The fermionic 
nature of these fields will provide minus signs when calculating loops, thus providing a 
mechanism through which the cancellation between valence quark and ghost quark loops is 
built into PQChPT. 

The flavor-neutral sector is a little more complicated, because of the m 2 , term (or, equiv- 
alently, because of mixing between the U, D, etc. fields). Let us calculate the propagator 
From the first two terms in Eq. ( 14.20)) . there is a contribution 

Gii - <*«%> = ~^-T , (4-23) 
V ~r "Hi 

J +1 , i sea or valence 
e% ~ y — 1 , i ghost 

from the first two terms in Eq. f)4.20p . with a minus sign as in Eq. f!4.22p if i refers to a 
ghost. There are additional contributions coming from the m 2 , term, that can be found by 
treating this term as a two-point vertex, and doing the geometric sums. In detail, we can 
rewrite 

im 2 (str($)) 2 = im 2 «!'„ , (4.24) 



71 For a proof that this is a mrrer. t pr 

72 And add more terms, see Refs. [12|, 



sdure, see Ref. 
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with 




(4.25) 



One finds, using that 



(4.26) 



A: 



for the contribution proportional to ml 58|, |59j: 



-m 2 /3 (GKG)ij + (-ml/3) (GKGKG)^ + ... 



(4.27) 



-ml/3 [ 1 




(p 2 + m 2 i )(p 2 + m 2 j) (p 2 + m 2 )(p 2 + m 2 )(p 2 + m 2 ,) 
— 1/3 (p 2 + mlj)(p 2 + m 2 D )(p 2 + m 2 s ) 



(p 2 + m| ) (p 2 + mjj ) (p 2 + m\ Q ) (p 2 + m 2 ) 



Let us step through this equation line by line. In the first expression on the right-hand side, 
the sum in the denominator really extends over all flavors, but the ghost terms cancel the 
valence terms, reducing the sum to run over the sea flavors u, d and s only. The second 
expression is just a rewriting of the first line. It is clear that the denominator of the second 
factor is a third-order polynomial in p 2 , and that it has to have poles at the physical masses 
(this factor only refers to the sea sector). These are of course the masses of the 7r°, tj and 
rf . The rj' mass-squared contains a term proportional to m§, 



whereas the 7r and r\ become independent of wig when we take it to infinity. This observation 
explains the last line in Eq. (I4.27p . In Eq. ( I4.28P I made the dependence on the number of 
sea quarks, N s , explicit — of course, in our explicit example above N s = 3. 

This result for the neutral propagators exhibits a new property of the partially quenched 
theory. The easiest way to see this is to note that, if we take i = j to be a valence flavor, 
there is a double pole in Eq. (I4.27p . This is a clear sickness of the partially quenched theory, 
reflecting the fact that it does not correspond to a standard field theory in Minkowski space. 
We should expect, however, that if we take both i and j to correspond to sea flavors, that 
everything works as in the full theory: by design the sea sector does not know about valence 
and ghost quarks! Indeed, if we pick both i and j one of u, d, s, we see that at least one of 
these poles cancel against a factor in the numerator, leaving us with an expression that can 
be written as a sum over single poles. (This also works if there is a degeneracy in the sea 
sector.) It is a nice exercise to work out the propagators for the sea n° and r/, and find that 
indeed they agree with the expressions one finds in the full theory. A very simple example 
is the case of degenerate sea masses, my = mo = m$ = m sea , for which we obtain for the 
neutral sea propagator the expression 




— - ml + terms proportional to sea quark masses , 



(4.28) 




(4.29) 
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where the "1" term comes from the first two terms in Eq. (I4.20p . and the "1/3" term comes 
from Eq. (I4.27p . and projects out the r( ' . 

So, while the sea sector is healthy, as it should, the neutral valence sector is sick. So 
what? Maybe this just tells us that we should not consider flavor-neutral valence pions. 73 
However, even if we make this choice, the sickness permeates to the flavored sector as well, 
as I will now argue. 

We should be able to calculate chiral logarithms in meson masses, decay constants, etc., 
from Eq. (14.201) as usual. The four-point vertices following from C^ 2 ' have a flavor struc- 
ture &ij§jk&ki&u, dressed up with derivatives or quark mass factors. Consider a one-loop 
contribution to (QijQji), with i ^ j so as to stay away from the dangerous neutral sector. 
Because we only have four-point vertices, all one-loop diagrams are tadpoles, with and 
$ji connected to the same vertex. One of the Wick contractions is the one in which the 
index structure on the vertex is <&ij$>jj$>ji<$>n, where $jj and $ J5 have to contract and form 
the loop! We see that the neutral propagator terms of Eq. (I4.27P unavoidably show up, if we 
want to make use of partial quenching at all. For simplicity, let us work out the degenerate 
case mu = m D = m s = m sea , ma = rrijj = m va [. Simplifying Eq. (I4.27P accordingly, and 
integrating over p, as one would in the one-loop diagram, we obtain 

1 f d 4 p p 2 + m 2 sea 1 d 



3 J (2vr)4 (p 2 + m 2 val ) 2 3 dm 2 



val 




(4.30) 



9 

mi 



The arrow in the second line indicates that I only kept the chiral logarithm. This chiral 
logarithm is not of the structure we encounter in full QCD, where no infrared divergences 
can occur. Here there is an infrared divergence: Eq. f)4.30p diverges when we take the valence 
mass to zero at fixed sea mass. Only when we take these masses in a fixed ratio can we define 
the limit. This means that the chiral expansion in the partially quenched theory will not 
converge if we take the sea and valence masses too different. 74 We should not only keep m 2 al 
and m 2 ea small in the sense of the chiral expansion, but we should also keep the unphysical 
infrared scale m 2 ea — m 2 al small compared to either of those, \m 2 ea — m 2 al \ ^\mn(m 2 sea) m 2 al ). 
For current numerical simulations, these conditions are well satisfied, so that the infrared 
divergences of the partially quenched theory do not constitute a problem in practice. 

There are many examples of the unphysical infrared behavior of the partially quenched 
theory — the phenomenon is generic. Here I briefly mention two of them; for detailed 
explanations, see the original papers quoted below. 



The first example occurs in scalar-isoscalar and scalar- isovector propagators [72J. In that 
case, there are one-loop contributions with tt — tt or tt — rj intermediate states. In the isopin 
limit, Eq. ( I4.27P contributes to the i] internal line. With the additional pion internal line, this 
leads to an integral like Eq. (I4.30p . but with three powers of p 2 + rn 2 al in the denominator. 75 
This leads to an infrared divergence (at fixed m 2 ea ) that goes like {m 2 ea — m 2 ^) /m 2 ah worse 
than the logarithmic divergence of Eq. f !4.30p . 



73 They are numerically hard, because of disconnected diagrams, anyway. 

74 For a heuristic argument that such infrared divergences are a property of QCD, and not an artifact of 



ChPT, in the quenched case, see Ref. [71 j. 
75 Using the quark-flow picture of the next section, it is straightforward to see that all propagators in this 
contribution are valence propagators. 
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A second example where the double pole has a dramatic effect is in the contribution 
from one-pion exchange to the nucleon-nucleon potential. In full QCD one-pion exchange 
leads to a Yukawa potential, exp(— m va ir)/r, whereas double-pole terms in the exchange 
lead, instead, to a potential of the form ijn 2 sea — m 2 , nl )exp(—m v n ir) /m va i, as can be seen by 
differentiating with respect to m 2 al , like we did in Eq. (I4.3(jp [731 ] . The double pole thus 
leads to an unphysical interaction that dominates the physical term at large distance. 

In all cases, the unphysical infrared effects are, of course, proportional to m 2 sea — m 2 al . 
In general, the message is that partial quenching can be a useful tool, but in order to make 
use of PQChPT, we do not only need m 2 sea and m 2 al to be small enough, but also their 
difference, m 2 sea -m 2 val . 

Having done all the preparatory work, let us consider the O(yfi) result for the valence 
pion mass, in the case of degenerate sea quarks, and for m x = m y [741]: 76 

m x = 2B m x < 1 + 2 ^ 2 p [ym x - m sea ) log [—^ — J + m x - m Sl 

ULn ((2L 8 - L 5 )m x + 3(2L 6 - L,)m sea )\ . (4.31) 



P 

The chiral logarithms in this expression precisely come from the "double-pole" part of the 
neutral propagator, Eq. (I4.27p . i.e. , they are of the type Eq. (14.301) . 77 

This result, finally, allows us to return to the point as to why partial quenching is useful. 
As we have seen, full QCD is contained in PQQCD — the only difference is that we choose 
different masses for the valence and the sea quarks. Since the LECs are, by construction, 
independent of the quark masses, the LECs in PQChPT and full ChPT are identical. In 
other words, partially quenched lattice computations give us access to the real- world values 
of the LECs of ChPT [59| ; the fact that we can vary the valence quark masses independently 
simply gives us another "knob to turn." This can be very useful: it is clear that by varying 
the valence quark masses independently, one can extract 2L 8 — L 5 from the pion mass, 
without varying the sea quark masses. This stands in contrast to the full theory: as can 
be seen from Eq. ( I2.40p . in order to separate 2L 8 — L 5 from 2L 6 — L 4 , one would have to 
vary the sea quark mass. It is important to note that the LECs do depend on the number 
of "dynamical" flavors, N s — one still has to do the numerical lattice computation with the 
correct number of sea quarks. 

While it is true that the values of the LECs in PQChPT and full ChPT are the same, 
there can be more LECs in the partially quenched case. This already shows up at order 
p A : the operator ti(L^ t L u L fl L v ) is not independent from those in Eq. ( I2.17P for SU(3) cf. 
Sec. IIIB[ but str(L^L u L^L u ) is for SU(3 + N V \N V ) [76|. 78 One thus obtains the most general 
0(p A ) lagrangian for PQChPT by replacing traces with supertraces in Eq. f)2.39p . and by 



76 See also Ref. 



75] , which also discusses the case in which Xsea is not very small compared to mp ; this can 
lead to a reduction of the coefficients of chiral logarithms. 

77 All contributions to the self energy with mesons containing sea quarks on the loop get absorbed by the 
wavefunction renormalization. 

78 For another example relevant for weak matrix elements, see Ref. [771 ] . For a discussion of the implications 
of graded group representation theory for "external operators" such as electroweak operators, see Ref. [s^]. 
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adding 



J PQ 



str(L M L 



+ 2 str(L^L IM L u L u ) + 



(str(L M L /J )) - str(L M L t/ ) str(L At L 1 , 



(4.32) 

where the new operator is written in this way because this combination vanishes for SU(3). 
This new operator does not contribute at tree level, because at tree level there is no dis- 
tinction with full QCD, for which SU(3) is the relevant symmetry group. It can contribute 



76 , where, of 



to one-loop diagrams, and does so in the case of pion scattering, for instance 
course, this only happens if valence masses are chosen different from sea masses. 

Finally, we remark that quenched QCD, and thus quenched ChPT corresponds to the 
special case that N s = [70|, |78j. While we are not pursuing quenched QCD in these 
lectures, it is worth noting that this special case has, in fact, some special properties. For 
example, one cannot take the limit — » oo, as inspection of Eq. (I4.27P reveals (the factor 
in parentheses on the first line gets replaced by one). Of course, quenched QCD does not 
have the correct number of light sea quarks, and is thus not only sick, but terminally ill. No 
physical result with controlled errors can ever be obtained from the quenched approximation. 



C. Quark flow 

In the previous section, we have seen that PQChPT can be systematically developed for 
PQQCD. Once we assume that this EFT exists just as in the full-QCD case, the construction 
is quite straightforward. But, one may like to have a more pictorial understanding how it 
all works, and this is provided by the quark-flow picture (which, in a sense, is formalized 
through the replica trick, but here I will not attempt to present it that way). 

We have two types of quarks: valence and sea quarks. Valence quarks are represented 
by propagators, as in Eq. (14. ip . and in a diagrammatic language, can be represented by 
valence quark lines. Each valence quark line has to have a beginning and an end at some 
external source or sink. Valence quark loops can be formed for instance if more than one 
valence line connects the same source and sink, as in the example of Eq. (14. Xp . The "valence 
quark part" of a contribution to a correlation function can thus be precisely defined, and the 
topology of valence lines represents the various possible Wick contractions which contribute 
to a particular correlation function. 

In perturbation theory, the (logarithm of the) fermion determinant, which comes only 
from sea quarks, can be represented by an infinite sum over one-loop diagrams with any 
number of gluon lines attached to the sea-quark loop. One thus imagines the diagram 
coming from a certain Wick contraction in the valence sector to be dressed up with gluons 
and sea quark loops in all possible ways. In this way one can also represent sea quark 
contributions through loops, but clearly this picture is not rigorous outside perturbation 
theory. In this picture, however, the reason that only internal (i.e. , not connected to a 
source or sink) sea quark loops occur is because valence and ghost loops, which in principle 
also are present, 79 cancel each other. It is instructive to consider PQChPT in terms of quark 
loops. 

Figure |6] shows quark-flow representations of the meson propagators at tree level, as well 
as the four-point meson vertex. The flavored propagators that follow from Eq. (14. 20[) are 



For instance, they would be present if one takes the ghost masses different from the valence masses. 
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FIG. 6: Quark flow diagrams for ChPT Feynman rules, (a): flavored propagator, (6): TOq part of 
the flavor-neutral propagator, cf. Eq. (|4.27j) ■ (c): single-trace four-point vertex, (d): double-trace 
four-point vertex. Arrows on quark lines are shown in (a), and understood in the other figures. 
See text. 
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FIG. 8: ChPT diagrams for the pion one-loop self energy. A dashed line stands for one of the 
meson fields in The cross stand for an insertion of Eq. (|4.27|) . 



represented in "double-line" notation, with each line denoting a quark of a certain flavor, 
Fig. [Bh.. Note that we only draw valence lines — one is to imagine these diagrams arbitrarily 
dressed up with gluons and sea quark loops. In the flavor-neutral sector, we have seen 
that there are additional contributions, which follow from iterating the two-point vertices 
proportional to rrip. Flavor-neutral mesons get a contribution as in Fig. [6b, (just set i = j), 
but also get a contribution from Fig. [6b, where the "double hairpin" represents the 
two-point vertices. This is what we usually call a "disconnected" diagram in lattice QCD, 
because it represents a disconnected valence Wick contraction, but again, it is not really 
disconnected in terms of gluons. Note that the double hairpin stands for the full geometric 
sum (14.271) of all insertions of the ml vertices, where each additional insertion of a double- 
hairpin vertex corresponds to another sea-quark loop inserted between the two hairpins. 

Figure [6b represents a vertex with flavor structure $jj$jfc<I>fcn<I>ni, which follows from the 
single-trace terms in £( 2 \ whereas Fig. [6]i represents the flavor structure $jj$jj$fc n $ n fc, 
which is a double-trace term. The latter do not occur in C^ 2 \ but they do occur in 
Various one-loop contributions to a pion propagator are shown in Fig. Each internal 
quark loop has to be summed over only the sea quarks, because valence and ghost internal 
loops cancel each other. It so happens that only diagrams of type [7b contribute at order p 4 
to meson masses; more types contribute to decay constants. 

From these examples, it is clear that there is no one-to-one correspondence between 
PQChPT diagrams and quark-flow diagrams. The PQChPT diagram representing all dia- 
grams of Fig. [TJis just the tadpole shown in Fig. [Bb. One can refine this a little by representing 
the hairpin vertex by a cross, as in Fig. [8b. Of course, one can also use the quark-flow picture 
for correlation functions in full ChPT, and one can use those as a starting point to develop 
PQChPT intuitively, as was originally done for quenched ChPT [78]. It can be very helpful 
in seeing how ChPT should work for some "modified" version of QCD. For an example of 
applying this to staggered ChPT for staggered QCD with the fourth-root procedure, see 
Ref. [5l|. However, it is important to have a path-integral representation for each modified 
version of QCD, 80 as this gives us the opportunity to use many of the standard tools of field 
theory. Examples of these tools are the precise definition and role of the symmetries of the 



This is the problem with staggered QCD with the fourth-root procedure, see Ref. [4) 
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modified theory, and the availability of field redefinitions |58|, [70 . 

Double-hairpin diagrams as in Fig. [6b contribute only to flavor-neutral propagators (at 
tree level). The quarks on each of the hairpin lines are valence quarks, and the quark- 
antiquark pair that each of these hairpins represents gets bound into a valence meson — we 
thus expect each of these hairpins to represent a single meson pole with the mass of a valence 
pion. Any other quark loops in this diagram are not explicitly shown, and can only come 
from the fermion determinant. In other words, any such quark- loop contributions can only 
involve the sea quarks, and, if sea quark masses do not equal valence quark masses, nothing 
in the diagram can remove the two valence-pion poles. We conclude, at this heuristic level, 
that double pole terms have to occur in neutral pion propagators in PQQCD, and this is, 
of course, confirmed in PQChPT. 81 Because double poles cannot occur in full QCD, we also 
conclude that the residue of these double poles has to be proportional to m sea — m val . 

Another observation that is easily made in the quark flow picture is how PQQCD gives 
access to Wick contractions not available in full QCD. Suppose we do take all valence masses 
equal to the sea masses, so that the theory is not really partially quenched. The only effect 
of partial quenching is that we have more flavors available to put on the external lines 
than run around on the internal loops which come from the fermion determinant. We can 
use this, for example, to separate the disconnected part, Fig. [6b, from the connected part, 
Fig. Ek, of any flavor-neutral meson propagator. In other words, partial quenching separates 
the contribution of different Wick contractions to the same correlation function. At the 
level of quark flow diagrams, this is obvious. But since all that happens at the quark flow 
level is encoded in PQChPT, we can use this to obtain systematic chiral expansions for 
these separate Wick contractions. A simple example follows from Eqs. (14.231) and (14.271) . 
In QCD with one flavor, these two contributions cannot be separated. But, one can add 
valence quarks to the theory (keeping the number of sea quarks at just one), and use this 
to separate the connected and disconnected parts (14.231) and (I4.27P even in QCD with one 
flavor. This can for instance be used to define what it means to set the quark mass to zero, 
by requiring that the pion mass obtained from the connected part vanishes [79| . 

Another example of this is given in Ref. [66], in which it is shown how this can be used 
for a determination of L 7 . For other work using the "extra" quarks available in the partially 
quenched theory to separate connected and disconnected Wick contractions, see Ref. [80( 



for an application to electromagnetic properties of baryons, Ref. [81[ for an application to 



quenched and partially quenched nonleptonic kaon decays, and Ref. [82| in the context of 
the hadronic contribution to g — 2. 



D. ChPT for mixed actions 

QCD with a mixed action is a generalization of PQQCD, because not only do we treat 
the valence and sea quark masses as independent, but also their lattice Dirac operators. 
An important consequence is that mixed-action QCD has fewer flavor symmetries than its 
partially quenched cousin [83| . Revisiting the general observations in support of the validity 
of PQQCD in Sec. IIV B[ we note that observations 1 and 3 still apply, but that observation 
2 has to be modified. It is still true that full QCD is a subset of mixed-action QCD, but it 
is no longer true that one can rotate sea quarks into valence quarks. 



81 



For a much more detailed argument along these lines, see Section 6 of Ref. 66 1. 
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If we would consider a theory with two sets of sea quarks, with each set discretized with 
a different lattice Dirac operator, there would be little doubt that this constitutes a valid 
discretization of QCD, because of universality. One would expect that the framework of 
Sec. Ill II applies to such a theory as well, with the necessary technical modifications. In 
mixed- action QCD, however, one of these sets is quenched, making it less obvious that a 
universal continuum limit exists. But if it exists, it will clearly have to be the corresponding 
partially quenched version of QCD. Hence, if one combines the assumption that PQChPT 
is the correct EFT for PQQCD with the notion of universality of different discretizations, it 
appears reasonable that we can combine the techniques of Sees. II III and IIVBI to construct a 
chiral EFT for mixed-action QCD. 

The relevant symmetry group clearly depends on what discretization we use for the 
valence and sea quarks. For definiteness, let us consider the case that we choose Wilson 
fermions for the sea quarks, and overlap fermions 82 for the valence quarks. 83 The lattice 
symmetry group (ignoring the soft breaking by quark masses, and removing "overall" U(l) 
factors as usual) is then SU(N S ) V x [SU(N V \N V ) L x SU(N V \N V ) R ]. With overlap fermions, 
the symmetry group in the valence sector is the same as in the continuum, but Wilson 
fermions have no chiral symmetry, and the symmetry group is thus the smaller group of 
vector-like transformations only. 

We can now construct the chiral lagrangian (valid in the GSM regime to order p A , and 
in the LCE regime to order p 2 ) by writing down the SET for the mixed-action theory, 
introducing spurions, and making the transition to the chiral lagrangian as we did in Sec. IHH 
and this has been done systematically in Ref. j29j. However, it is rather simple to understand 
the result, so, rather than repeat the whole analysis, I will first quote the result, and then 
comment on the various terms. The result is 

C = £? o Q n ? CD (2B M s -> 2B M' S = 2B M S + al) (4.33) 
+A£ (4) (a -> aP s ) - a 2 W M str(P s SP s S f ) , 

with A£( 4 ) given by Eq. A3 . 15j) . and the subscript cord stands for "continuum." Here P s is 
the projector on the sea sector, defined by P s q s = q s and P s q v = 0, P s q = 0. While this 
equation contains a lot of information, it is easy to understand this result. First, since the 
valence quarks have exactly the same chiral symmetries as in the continuum, there are no 
explicit 0(a 2 ) terms in the valence sector of the chiral lagrangian. The only such terms 
come from the Wilson sea sector, and this explains the appearance of the projector P s in 
Eq. (I4.33p . and the fact that only the sea quark masses get shifted by an 0(a) term, as in 
Eq. (13. lip . Of course, there are scaling violations in the valence sector as well, but they can 
only occur as 0(a 2 ) corrections to the continuum LECs, /, Bq and but in the GSM regime 
these are of order p 6 . 84 Rotational symmetry is always broken on the lattice, but, as we 
have seen before, the effects of that only shows up at higher orders in the chiral lagrangian. 
Essentially, the form of the lagrangian (I4.33P is thus that of the continuum lagrangian, plus 
the symmetry-breaking lattice part for the sea sector. However, one completely new term 
does appear in Eq. (I4.33p . and that is the term with the new LEC Wm- This term is allowed, 



82 Or domain-wall fermions with a very small residual mass. 

83 For the case of overlap fermions on a staggered sea, see Ref. 

84 To the best of my knowledge, the 0(p A ) chiral lagrangian with LCE power counting has not been con- 
structed. 



56 



because there are no symmetries that connect the valence and sea sectors. The new term 
does not break (continuum) chiral symmetry restricted to the sea sector, so a similar term 
also appears if we choose different types of sea quarks, such as staggered or domain-wall 
quarks. Since this new term does not break chiral symmetry, a similar term also arises in 
the valence sector. But since we have that P v = 1 — P s and EE' = 1, we can write 

str^EP^) = str(P s EP s E f ) + str(l) - 2 str(P s ) , (4.34) 

so only one of these terms is independent. If we take the valence quarks to be the same 
as the sea quarks, the projector P s is replaced by the unit matrix, and the new term is an 
irrelevant constant. But with a mixed action this is not the case, and the new term should 
be taken into account. 

Let us consider the 0(p 2 ) expressions for meson masses in the LCE regime, beginning 
with the flavored ones. By expanding the lagrangian to quadratic order in $, we find, 
using now Latin indices for the valence flavors and Greek indices for sea flavors, in order to 
distinguish them more clearly: 

ra?- = B (rni + mj) , (4.35) 



mlp = B (m' a + m' p ) + — (N s W's + W$ 



32a 2 
T 2 



An 2 

ml = PqK + m' a ) + — (4N S W, + 2W^ + W M ) . 

The factors N s in these expressions come from factors str(P s ) which occur working out the 
expansion in <£>. The new LEC Wm only shows up in the "mixed" pion mass, i.e. , the 
pion made of a valence and a sea quark, hence there is no 0(p 2 ) relation between the pure 
valence, pure sea, and mixed pion masses. 85 In any application of QCD with a mixed action, 
the mixed pion mass gives thus an important indication whether one should use GSM or 
LCE power counting. 

In the flavor-neutral sector, all the complications of PQQCD of course are inherited by 
the mixed-action case, including the appearance of double-pole terms, but a new issues also 
arises. In the sea sector, to quadratic order, there is an additional contribution to the m 2 , 
term in Eq. ( I4.20p . coming from the Wj term in Eq. ( J3.15]) shifting the value of m\ only in 
the sea sector (because of the P s projectors, cf. Eq. (14.33}) ) effectively to 



m, 



2 2 96a 

o,eff = m o + -J5 



ml + —^W; . (4.36) 



For a theory with degenerate valence and degenerate sea quarks, one finds for the flavor- 
neutral propagator in the valence sector [85| 



Sjj _ 1 ml(j> 2 + m 2 sea ) + (N s m 2 Q /3)(ml eff - m 2 Q ) 
P 2 + mlai 3 (p 2 + m 2 val ) 2 {p 2 + m 2 sea + N s ml eff /3) 
_ u 1 (m 2 ea -m 2 val )/N s + 32a 2 W^/f 2 



13 N s Jp 2 + m 2 val (p 2 + m 2 val ) 2 



85 In the GSM regime there is, because the 0(a 2 ) terms are of higher order in that regime. In the GSM 
regime we do thus have that rc? mixed = {m 2 sea + m 2 , al )/2 to lowest order. 
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where the second line is obtained by taking ttlq — > oo in the first line of this equation. 

The importance of this result lies in the fact that scaling violations are not universal. 
Naively, one might think that the best way to conduct a mixed-action lattice computation 
is to tune the quark masses such that the valence pion mass is equal to the sea pion mass. 
However, comparison between Eqs. (I4.35P and ( I4.37P shows that this is not the same as 
tuning quark masses such that the residue of the double pole vanishes. If one tunes quark 
masses to make sea and valence pion masses equal, there will still be a double pole in the 
theory, with a residue of order a 2 . Since the double pole can lead to infrared "enhancement" 
effects (such as the enhanced chiral logarithms and other such examples we encountered in 
Sec. HVBj) . such scaling violations could be larger than one would expect based on a simple 
estimate of the size of (ahQCD) 2 - This phenomenon does not occur at this order in the case 
that one uses overlap valence quarks on a staggered sea; because both valence and staggered 
quarks have exact chiral symmetries, in that case the effect can occur at the earliest at order 
MS- 



a m quark 



1. One-loop universality 

As soon as the lattice spacing is large enough that we need to include it as a variable in 
chiral fits, the version of ChPT we need to use will depend in some detail on what kind of 
lattice action is being used. For instance, ChPT for Wilson fermions is quite different from 
ChPT for staggered fermions. This is in general also true when a mixed action is used. This 
is not a problem of principle, but for each new type of lattice action, the necessary ChPT 
calculations need to be repeated. 

It would be very nice if some general prescription existed that allowed one to take a 
calculation carried out in continuum PQQCD, and modify it using some simple rules into 
the appropriate mixed-action ChPT expressions. While this is clearly not possible in general, 
it is possible under some limitations, which however cover many cases of practical interest. 

As we noted before, the use of a mixed action is particularly useful when the valence 
fermions have exact chiral symmetry (broken only by the physical quark masses), while the 
sea sector has much less symmetry. If we allow only valence quarks on the external lines 
of any correlation function of interest, in order to access what we will refer to as "valence 
quantities," and we work to leading order {i.e. , tree level) in ChPT, one clearly needs just 
that part of the 0(p 2 ) chiral lagrangian that refers only to the valence sector — one can 
simply set all sea- meson and mixed- meson fields equal to zero. This is nothing else than 
the observation that sea quarks (and thus sea and mixed mesons) can only appear on loops. 
If the lattice valence quarks have exact chiral symmetry, the 0(p 2 ) chiral lagrangian is the 
same as in the continuum, i.e. , it takes the form of Eq. ( I2.33p . 

What is remarkable is that a similar argument holds at order p A , i.e. , to one loop [86]. 
For one-loop contributions we need only the 0(p 2 ) lagrangian, and, in one-loop diagrams 
with only valence quarks on the external legs, only one sea quark loop can occur, as can 
easily be seen in the quark-flow picture. 86 Let us see how this works in the case that the 
sea quarks are Wilson-like, and that we are in the LCE regime. To order p 2 , the chiral 



For disconnected contributions, cf. the second term in Eq. (|4.37j) . see below. 
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lagrangian is then 



C LO = \f str ((P^E^E) - ^B f 2 str (ME + E^) (4.38) 



-a 2 W' & (str(P,E + E f P s )) - a 2 Wj (str(P s E - E f P 
-a 2 ^ str(P s EP s E + 1$P t YtP s ) - a 2 W M str(P s EP s E 



from Eqs. (I2.33H and ( 13. 1 5ft . where I have inserted the projectors P s , and in which the 0(a) 
term has been absorbed into the sea-quark mass matrix, as in Eq. f)4.33p . As noted, we 
can have only one sea quark on the loop. This implies that if we expand out E in terms 
of $, we will only use those terms in the expansion of any of the 0(a 2 ) operators in which 
two of the indices on the $ fields correspond to sea quarks (these two indices correspond to 
the "beginning" and the "end" of the sea quark loop), with the rest of the indices referring 
to valence quarks. Because of the P s projectors in the 0(a 2 ) terms, such terms only arise 
when we set one E (or one E^) equal to one, in all possible ways in which this can be done. 
Applying this "1-loop valence" rule to Eq. (14.381) . it simplifies to 



C 



LO 



1— loop valence rule 



±f 2 str ((P M E)tP M S) - ip / 2 str [ME + E^M 
-a 2 (AN S W^ + 2W' % + W M ) str (P S E + E^P S ) . (4.39) 



In other words, all dependence (to one loop) on the LECs W$ 8 and Wm can be absorbed 
completely into a shift of the sea-quark mass matrix of the form [83] 

4a 2 

1-loop valence rule: B M S -> B Q M S + — (AN S W' % + 2W' S + W M ) 1 . (4.40) 

Note, however, that this simplified form cannot be used to calculate any quantity involving 
sea quarks! For instance, the leading-order meson mass for a meson made only out of sea 
quarks is given by the middle equation of Eq. (j4.35ft . which does not correspond to shifting 
the sea quark masses as in Eq. (I4.40p ; in particular, the LEC Wm does not occur. It also 
does not apply to the disconnected term in Eq. ( |4.3T[) . in which again the sea meson masses 
are given by Eq. (14.35jl . However, since this is the only place where the sea meson mass 
shows up at one loop in valence quantities, the LEC Wlj can be absorbed into the sea meson 
mass-squared, for the one-loop contributions to valence quantities. 

In summary, the one-loop expressions of continuum PQChPT for valence quantities, such 
as meson masses, decay constants and scattering amplitudes {88|, carry over to the mixed 
case, if one takes the mixed meson mass to be given by the last line of Eq. (|4.35ft . 

4a 2 

m 2 a = PoK + O + ~y ( 4A ^6 + + W M ) , (4.41) 



and the sea meson mass by 



32a 



2 



m 2 ap = B (m' Q + m 'p) + — (N S W^ + W 8 + N S W^) , (4.42) 

by combining Eqs. (j4.35ft and ( I4.37j) . It is only through these masses that the LECs Wg 78 
and Wm occur. 
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In a calculation to order p 4 , there are also tree-level contributions originating from terms 
in the chiral lagrangian of order p 3 and p 4 , cf. Eq. (I3.26p . Single-trace terms containing a P s 
thus do not contribute. In double-trace terms any E multiplied by a P s should be set equal 
to one. That means that such terms do contribute, but again they can be absorbed into a 
redefinition of other LECs. For instance, the W± and Wq terms in Eq. (j3.15p . reduce to 



aW 4 str (^E^E+J str (P S E + TtP 8 ) (4.43) 
-2aB W 6 str (MY) + EM) str (P S E + E f P s ) 
2N s aW A str (^E^E*) - 4JV s a£ W 6 str (AlE* + EM] 



— > 
or 

f^f + ^N S W, , P B (l + ^N S W 6 ) ■ (4.44) 

All double-trace terms in the chiral lagrangian need to be treated this way, leading to more 
redefinitions of the 0(p 2 ) LECs -Bo and f 2 . In this way the redefined LECs B Q and / also 
pick up dependence on the sea quark masses, from the L4 and Lq terms in Eq. (I2.39[) . 
Whether this is convenient or not depends on the application. Note that if one trades sea 
quark masses for sea-meson masses, one has to account for the leading-order 0(a 2 ) shifts of 
Eq. (14.351) . For more discussion, including the example of / = 2 tttt scattering to order p 4 , 
see Ref. Q. 

In summary, what we find is that, to order p 4 , the formulas of continuum PQChPT can be 
used for correlation functions with only valence quarks on the external legs, with a suitable 
redefinition of the mixed and sea meson masses and the 0(p 2 ) LECs. In other words, once 
these redefinitions have been made, the theory does not remember the fact that the sea 
quarks are Wilson like. That means that we uncovered something universal [86]: The same 



argument should also apply when the sea quarks are staggered! And indeed, it does: For 
example, it is straightforward to check from Eq. (13.611) that the shift in M s corresponding 
to Eq. (14401) is 

4a 2 

1-loop valence rule: B M S -» B M S + — (d + 4C 3 + 4C 4 + 6C 6 + C mix ) , (4.45) 



where C mil is the staggered equivalent of W 4 



M 



V. SU(2) VERSUS SU{3): PHYSICS WITH HEAVY KAONS 

So far, we have mostly discussed ChPT for QCD with three flavors to order p 4 . An 
important question is whether this is sufficiently high order for practical applications to 
lattice QCD, and the answer is, unfortunately, that in general it is not. It is a fact that 
the kaon mass is not so small in the real world, and even if ChPT applies to the strange 
quark mass as well, the expansion to order p 4 may not suffice. Indeed, m 2 K /m 2 13, 
suggesting that an expansion that works very well for m 2 may not work so well when m 2 K 
is the expansion parameter. 

One option for dealing with this problem is to go to higher order in the chiral expansion. 
In applications to phenomenology, there has been much work to extend ChPT calculations 
to order p 6 [891 ] . For applications to the lattice, 0(p 6 ) expressions, which involve two- loop 
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calculations (as can be seen from Eq. (I2.18P ). need to be extended to the partially quenched 
case, and ideally, also to include scaling- violation effects. Meson masses and decay constants 
have been calculated in partially quenched, three- flavor ChPT to order p 6 [90], and it will 
be interesting to see what happens if the two-loop chiral logarithms are included in fits to 
lattice data. Scaling violation effects have not been calculated in ChPT to the same order. 87 
While this means that in principle lattice results would have to be first extrapolated to the 
continuum, before chiral fits are performed, it may be interesting to include "what we know," 
i.e. , the continuum part at order p 6 , because the continuum 0(p 6 ) terms are expected to be 
important at larger quark masses, where scaling violations are less important. Up to date, 
something similar has been done by MILC, which included all the analytic terms to order 
p 8 in their mass and decay constant fits, but no logarithms beyond order p A (yet). 88 For an 
early attempt to fit Nf = 2 + 1 full-QCD overlap results to 0(p 6 ) ChPT including two-loop 
nonanalytic terms, see Ref. [91] . 

It has recently been suggested that kaon loops are not a reliable part of ChPT [92], 
because the relevant scale is 2m k ~ 1 GeV, rather than txik itself. Diagrams can be 
reconstructed from their cuts and poles, and, for a meson of mass m, these cuts typically 
start at 4m 2 , and not m 2 . For instance, the contribution of a kaon pair to the imaginary 
part of the tttt scattering amplitude starts at s = Am 2 K \ for other examples (F„ and the 



pion electromagnetic form factor), see Ref. [92]. Of course, 2m k ~ 1 GeV is too large to 



trust a chiral expansion in that scale. However, this appears to be a quantity- dependent 
observation. It applies to purely pionic quantities, since (virtual) kaons always have to 
contribute in pairs to these quantities. But if we consider for instance nK scattering, the 
cut starts at (m, + m^-) 2 m m 2 Kl and the chiral expansion for this scattering amplitude as a 
function of m^ may have better convergence properties. A similar argument applies to Fx- 
For lattice computations relevant for phenomenology, there is a different option. It is 
not difficult to do lattice computations with the strange quark mass adjusted to (close to) 
its physical value, so that only extrapolations in terms of the light quarks masses (m u and 
ma) are needed. In this case, we do not need three-flavor ChPT, but, rather, we can work 
with two-flavor ChPT, treating the strange quark as "heavy." Note, however, that we do 
not integrate out the strange quark, which we cannot do, because m s is not large compared 
to Aqcd- The actual lattice computations still have to be done with three flavors, up, down 
and strange. 

In the pion sector it is straightforward to develop ChPT again, one simply starts from 
the group SU{2)l x SU(2)r, instead of SU(3)l X SU(3)r. 89 But the properties of kaons (its 
mass, decay constant, the kaon B parameter, etc.) will depend on m u and rrid through inter- 
actions of kaons with pions, and since lattice computations will still typically use unphysical 
values for the up and down quark masses, we will still need to know how to extrapolate kaon 
properties in those light quark masses. We thus need to extend two-flavor ChPT systemat- 



87 In the GSM regime, m qua rk ~ a ^QCD' this wou ld be easier to carry out than in the LCE regime 



o 2 A 3 
" qcd ■ 



QCDi 

S8 This means that the fitted LECs at order p 6 from the lattice cannot be compared to any continuum 
results. While this "hybrid" fitting method might also affect lower-order LECs, they are expected to be 
much less sensitive, because their values are predominantly determined by lattice results at lower quark 
masses, where the 0(p 6 ) effects are less important 18 1. 



89 In this section we will use the word "pion" to refer to and 7r°, because we will not think of the kaon 
as an approximate Nambu-Goldstone boson. 
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ically to include the interactions of kaons with pions. How to do this will be the topic of 
this section. Studies using this EFT in application to three-flavor lattice data were recently 
carried out in Refs. 93, While the case of kaons is particularly simple, the basic setup 
also applies to EFTs for the coupling of baryons and heavy-light mesons to pions. 

While this alternative approach may turn out to be a useful approach, it is a more limited 
approach, because the kaons are not treated as Nambu-Goldstone bosons. For instance, 
while the dependence of the kaon mass on m u and is predicted by this EFT, it will 
not go to zero in the chiral limit. Instead, the chiral-limit value of the kaon mass is a new 
parameter in the EFT. Another example is that in this approach, one cannot ask certain 
questions, such as the value of the three- flavor condensate in the chiral limit. This particular 
question is interesting, because it has been argued that if this condensate were very small, 
that would lead to the need to redefine the power counting of ChPT, and thus rearrange 
the chiral lagrangian order by order 95|]. To answer this question, one needs to work with 
the three-flavor theory, and vary the strange mass. In the setup we will investigate in this 
section, one only gets to ask about the two-flavor condensate, at the physical value of the 
strange mass. 90 



A. Including a kaon in two-flavor ChPT 

We need to develop two-flavor ChPT for the three pions, coupled to the isospin-1/2 kaon. 
In the special case of the kaon, it should be possible to deduce the results from ChPT with 
three flavors. For instance, one may take the result for fx/fw i n Eq. (I2.40p . and expand it 
in the light quark mass for fixed strange quark mass. One then absorbs the strange quark 
mass dependence into the LECs, and this parameterizes the dependence of fx/ fn on the 
light quark mass, while giving up information about the dependence on the strange quark 
mass. However, this method is not available for other hadrons such as baryons, and it is 
useful and instructive to develop a method that does not rely on three-flavor ChPT. After 
developing the method, we will return to the matching with three-flavor ChPT in some 
examples. 

The systematic method for coupling pions to a non-Goldstone hadron was worked out a 



long time ago in Ref. [96J, which shows how to couple any field with a given isospin to pions 
in the most general way. In fact, the method generalizes to other groups as well: The general 
case is that of some continuous group G that is spontaneously broken to some subgroup H. 
Since only H is realized in Wigner mode, all fields need to transform in representations 
of H, and not G, as we will see in more detail below. Here we will restrict ourselves to 
G = SU(2) L x SU(2) R , and H = SU(2) V , i.e. , isospin. 

We will present the construction as explained in Ref. j97|. As we have seen in Sec. Ill Bl 
the pion fields parametrize the coset [SU(2) L x SU{2) R ]/ SU{2) V (G/H in the general case), 
because 577(2) y transformations leave the vacuum invariant. An element of the coset can 
be represented by picking an element (ul, ur) of the full symmetry group SU (2) l x SU(2)r, 
and defining an equivalence class by making any two such elements equivalent when they 
differ by a transformation in SU(2)y. Each equivalence class is an element of the coset. We 
can pick a "standard" representative of each equivalence class by imposing a condition on 



90 Recent results for both condensates in the three-flavor theory can be found in Ref. [ijj]. It is found that 
while the three-flavor condensate is smaller than the two-flavor condensate, it is not very small. 
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ul and ur; we pick the condition that ul = u R = u. We can do this: by definition, if we 
multiply (ul,Ur) on the right by an element {h) , h)) e £77(2)^, obtaining {u^h) ,11^), this 
group element is in the same equivalence class, and thus represents the same coset element. 
If we pick h such that it solves ur = hu^ L h, this brings the coset element into standard form. 
If we now multiply the group element (ul,ur) by another group element, (vl,vr): 

{u l ,u r ) (v l u l ,v r ur) , (5.1) 

the new group element in general corresponds to a different element of the coset. But again 
we can choose a {h) , h)) e SU(2) V such that 

{u L ,u R ) -»• {v L u L h\vRU R }J) (5.2) 

keeps the transformed element in standard form, where, of course, h = h(u, Vl jR ) now 
depends both on u and (vl,Vr). If we choose (vl,Vr) € SU(2) V , i.e. , Vl — Vr — v , we can 
choose the "compensator" field h equal to v, and it does not depend on u. It follows that, 
as expected, an SU(2)v transformation stays in the same equivalence class. 
We now may define E in terms of u by 91 

E = ulu r = u 2 — > vl^v r , (5.3) 
u = exp(i(f)/f) , 

where now 

because then E also parametrizes the coset, and has the correct transformation properties 
under the chiral group if we identify Vl,r = Ul,r, cf. Eq. (12.111) . 

This construction provides a straightforward way to include the kaon. Introducing a field 
K in the isospin-1/2 representation of SU(2)y, we can extend the nonlinear represention of 
the chiral group to 

u v L uh\u,v LtR ) = h(u,v L}R )uv R , (5.5) 
K ->• h(u,v LtR )K . 

These two fields, their (covariant, see below) derivatives and the sources i^, r M , \ an d X s 
now form the set of building blocks from which to construct the chiral lagrangian. 

Note that we only have to specify the SU(2)v representation of the kaon field K, and not 
the representation of the full chiral group SU(2) l x SU(2) r . There is a simple intuitive reason 
for that Q: Suppose, for example, we specify that a kaon field K' transforms in the (1/2, 0) 
representation of SU(2)l x SU(2)r, K' — > vlK'. Then, performing the field redefinition 
K" = Y)K' = (1 + O(tt)) K' changes the representation to (0, 1/2), since K" v R K". We 
see that the presence of pions (or, in general, the Goldstone bosons associated with G/H) 
give us the freedom to choose any representation of SU(2) L x SU{2) R we want! The field K 



Whether with the symbol "u" I refer to the up quark or the nonlinear pion held should always be clear 
from the context. 
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used in Eq. (15. 5 j) is obtained by denning K = u'K'. 92 We note that the field K' transforms 
under parity to 

K' -> iT (p) = -E^' , (5.6) 

because the parity transform of K' should transform as K'^ — >■ VrK'( p > under the chiral 
group; the minus sign indicates that the kaon is a pseudoscalar. The field K transforms into 
—K under parity, consistent with its transformation rule Eq. ( 15. 5 p under the chiral group 
(h G SU (2)y)- 

In order to construct the chiral lagrangian, we need to construct invariants out of u and 
K, which transform as in Eq. (15. 5p . Since h is local, we will need a gauge connection for 
SU(2)v, constructed from u. We thus introduce 



f d^u , (5.7) 



which transform into each other under parity. Under Eq. (15. 5p . the parity odd and even 
combinations transform as 

A- = £ « - <) h « - «J) fct , (5.8) 



5 « + ^ « + <) - d.hhi . 

We see that A M transforms homogeneously, while the other combination transforms as an 
SU(2)v gauge connection. The latter can thus be used to define a covariant derivative for 
K: 

D^K = d^K+ l - (u^d^u + udpvf) K . (5.9) 

If we wish to include the sources £^ and r M , this can be done by replacing —> <9 M — it^ in 
and <9 M — > — ir^ in u^. 

The chiral lagrangian consists of three parts. First, there is the purely pionic part of the 
lagrangian, which is given by C^+C^ of Eqs. <^M> and (g3S} . For two flavors, a number 
of simplifications occur because of special properties of 577(2). Then, there is the quadratic 
term in the kaon fields that defines the kaon propagator in ChPT, 

C% ] = (D^K^D^K + M 2 K ] K . (5.10) 

Here M is a free parameter equal to the kaon mass in the chiral limit. 

As already noted, even with £ M = r M = the covariant derivative (15. 9p is required, 
because h in Eq. (I5.5P is a local, field-dependent transformation. Interactions between pions 
and kaons are therefore necessarily present in our EFT. In this respect, it is instructive to 
see how this same conclusion would have been reached if we started from the field K' = uK 
discussed above. It looks like we can write down a kaon kinetic term without any interactions 
in terms of this field: d^K'W^K' is invariant under the chiral group. However, this is not 



92 It can be shown that any other nonlinear representation of the group SU(2)l X SU(2)r with the same 
field content can be brought into the form (|5.5p by a field redefinition [96(. 
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invariant under parity. Using the parity transform K'^ introduced in Eq. (I5.6p . a lagrangian 
invariant under the chiral group and parity is 

~ (d^d^K' + d^K^dptpK')) , (5.11) 

which, using K' = uK, is equal to the first term in Eq. (I5.10p . modulo a term proportional 
to K^L^L^K = 2tr(A tl A^)K j: K that can be absorbed into the Ai term in Eq. (15.141) below. 

Many more terms involving pion-kaon couplings can be constructed. For a more extensive 
discussion of building blocks and possible terms in the chiral lagrangian, I refer to Ref. 97 
Here I give only one more building block: 
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X+ = u ] xu ] + u\ ] u , (5.12) 

which is even under parity (the odd combination, with a minus sign instead of a plus sign, 
appears in the chiral lagrangian at order p A ). 

With these definitions, noting that A^ = ^vlL^u, with = £<9 M £t (cf. Eq. (I2.16P ). 
Eq. (I2.33P can be written as 

£ (2) = ~\f tr(A M A M ) - \f tr(x + ) , (5.13) 
and more terms involving the kaon field are given by 

= A 1 tr(A^) K^K + A 2 tr(A M A„) {DpK?D v K (5.14) 
-A 3 K^ X+ K - A A tT(x + )rfK 

+A' 2 tr(A„A„) (D^D^K + A' 3 (D ^ X+ D + A' A tr( X+ )( J D At ^) t ^^ , 

where ^1,2,3,4 and A' 2 3 4 are seven new LECs. 94 Formally, we see that and C% are the 
first two terms in a small-derivative/light quark mass expansion, because only derivatives 
on the pion field u should be counted as small, and not those acting on the kaon field K. 
We note that some of the terms in would be counted as 0(p 4 ) if the kaon is treated as 
a Nambu-Goldstone boson, cf. Sec. [TT1 



B. Power counting 



At this point, we need to revisit power counting. First, there are now three scales in the 
problem, m n , m K and 47r/, cf. Sec. Ill C\ and we are now not assuming that m 2 K / \An f) 2 is 
small. Therefore, it is not obvious that a systematic chiral expansion in m^./(47r/) 2 and/or 
m^/mj^ can be set up. Second, our new lagrangian also contains interactions contributing 
for example to the scattering process 7r + 7r~ — > K + K~ . But, clearly, with the kaon mass not 
treated as a small parameter, the initial-state pions in this process also do not have small 
momenta in the sense of a chiral expansion. The prediction for this process from our new 
lagrangian can thus not be argued to be the leading order in a systematic expansion. 



93 Note that conventions in Ref. |97| differ from those used here. 

94 The primed A's are not present in of Ref. 97 1, and can probably be traded for higher-order terms by 

(2) 

field redefinitions. However, this is not mandatory, and I choose to keep them in C K . 
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FIG. 9: One-loop contribution to the kaon self energy with a K*tt intermediate state. 



We conclude that "kaon ChPT" cannot be used for any process near or above the two-kaon 
threshold. However, it can be used to calculate correlation functions in which a kaon goes 
in and comes out, so that the large energy residing in the kaon mass does not get converted 
to pions. This means that one can use the new lagrangian to calculate the dependence of 
kaonic quantities, such as its mass and decay constant, on the light quark masses m u and 
Note that in the leptonic decay of a kaon, its mass does get converted into the energy 
of the outgoing leptons, but that does not affect the calculation of its dependence on light 
quark masses. 95 

While this claim turns out to be correct, we still need to deal with the first problem 
mentioned above, i.e. , the appearance of a new "large" scale m^, in order to show its 
validity. From Eq. (15.101) it follows that the parameter M 2 only shows up in the kaon 
propagator. It can therefore only get promoted to the numerator of some contribution to 
a physical quantity by appearing in a loop. The simplest case is that of a closed kaon 
loop; for instance, the kaon tadpole diagram contribution to the pion self-energy that would 
follow from + . In dimensional regularization this would give a contribution of order 
m 2 K log (m 2 K /A 2 ), which is not small. However, such contributions can be absorbed into 
the low-energy constants of the pion lagrangian, because they are independent of the pion 
mass. 96 We conclude from this example that we can thus omit closed kaon loops from the 
calculation of correlation functions with kaon ChPT. 97 With this rule that we omit closed 
kaon loops, we conclude that correlation functions with only pions on the external legs 
should be calculated with the pion chiral lagrangian only. 

In effect, because the kaon is heavy compared to the pion, we have integrated it out, and 
including closed kaon loops as discussed above would amount to double-counting. This is 
of course the reason that we can use two-flavor ChPT for pion physics well below the kaon 
mass. The new element is that we can also consider processes with a kaon going in and 
coming out — as long as no kaons get annihilated or produced, the energy stored in the 
kaon mass does not play a dynamical role. An example is ttK — > ttK scattering: As long 
as the energy of the incoming pion is small, the energy of the outgoing pion also has to be 
small, and two-flavor ChPT (in the presence of a "heavy" kaon) should apply. For a detailed 
analysis of this scattering process, see Ref. |97 . 



95 Using kaon ChPT for semi-leptonic decays only works when the momentum of the outgoing pion is small. 

96 Similar to the way that the physics of p mesons and other heavy hadrons appears through the values of 
the LECs in the pion lagrangian. 
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The same argument was used for baryon ChPT already in Ref. [98 1. 
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There are also loops with internal kaon lines where the kaon does not form a closed loop, 
for instance in the just-mentioned example of kaon-pion scattering. To one-loop order, they 
do not occur in loop corrections to pion and kaon masses and decay constants, because all 
one-loop diagrams for these quantities are tadpole diagrams, which follows from the fact 
that there are no three-meson vertices in our EFT. However, such vertices do occur in the 
kaon theory if we also include the K*, described by a vector field also in the isospin-1/2 
representation of SU(2)y. In this extended version of kaon ChPT, a possible invariant is 

- igfKl(u%u)K = gK\Kd^ + • • • , (5.15) 

i.e. , now the theory does contain a K*K7r three-meson vertex. 

Let us consider an example. The vertex (15.151) leads to a correction to the kaon self 
energy through the diagram of Fig. [9j The amputated part of this diagram stands for an 
integral of the form [93[ (using dimensional regularization) 



(2tt)<* ( g 2 + m 2 )((p _ g) 2 + M 2) 



(5.16) 



in which M* is the mass of the K* . We need this integral for an onshell kaon, so we need 
to set p = Mv, with v a four- vector with v 2 = —1. The denominator of the K* propagator 
then becomes 

q 2 -2Mv q + M 2 - M 2 . (5.17) 

Nonanalytic dependence on m n comes from the region q ~ m n . The behavior of the integral 
now depends on the various other scales. If M* and M were the masses of spin-zero and 
spin-one heavy-light mesons (instead of the K* and the kaon), the term linear in M would 
be the dominant term, because M 2 — M 2 ~ Aq CD an d M* ~ M ^> Aqcd- 98 Instead, for the 
KK* system both masses as well as their difference are of order Aqcd, and the difference 
M 2 — M 2 is the dominant term. This leads to a different infrared behavior for each of these 
cases. For the heavy-light case, the denominator (15 . 1 7\i can be of order m n (times M), 99 
while for the kaonic case, the denominator does not become small. In the latter case, we 
can thus effectively replace this denominator by Aq CD , and read off the infrared behavior 
from the rest of the integral, which gives rise to a contribution of order m^.log(m 2 /A 2 ) 



93j . The power m\ has to occur because of dimensional analysis, and the fact that the 
kaon propagator, even though it appears inside a loop integral, can be "power-counted" as 
order Aq CD . I would expect this type of argument to extend to higher loop contributions, 
thus explaining how higher-loop contributions lead to nonanalytic terms at higher orders 
in m 2 . F or a n extensive discussion of all this in the context of baryon ChPT, I refer to 



Refs. m, 101 



It is straightforward to show that there are no m 2 log(m 2 /A 2 ) terms coming from C$ + 
Cfr, and thus 97 



m 2 K = M 2 — 2(A 3 + 2A 4 )m i + 0(m 2 \og(m e )) . (5.18) 



'IS 



Of course, Mv ■ q can be smaller than Aq C£) , but the q integral gets cut off in the infrared by m^. The 
situation is more complicated if there are also other infrared scales present, such as mo mentum transfer 
in a form factor or scattering amplitude. For detailed discussions, see for instance Refs. 
99 This leads to the terms of order which appear in the heavy-light and baryon cases. 



99 



100 1 . 
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This is consistent with the result for m? K shown in Eq. (I2.40p . which does not contain a 
term of the form m\ log(m^/A 2 ). 100 It should be noted that Ref. j9?| obtains this result in 
the theory without the K* . Indeed, since the K* is significantly heavier than the kaon, we 
should be able to integrate it out. Doing this leads to an effective coupling of the form 

g2 W2 KlA ^ K = a2 2jp tr(A M A M )#tK (5.19) 

(where we used that A M = —u^d^u is an element of the Lie algrebra for SU(2)), and we see 
that this is of the form of the first term in Eq. (I5.14p . 

C. An application: decay constants 

The pion decay constant follows simply from Eq. ( I2.40p . by expanding out L(m 2 K ) in rhf 
U = f fl + ^B m e - 2L(2B m e ) + 0(m 2 e ) J , (5.20) 



in which we can match the LECs /' and l\ to their three-flavor counterparts 12 



h = 4(L 5 + 2L 4 ) - 



(4tt) 




f = f\l- L(B m s ) + B m s ) , (5.21) 

1 



The matching formulas ( 15.211) only make sense if m s is small enough for three-flavor ChPT to 
apply; otherwise /' and U should be treated as two-flavor LECs that depend in an unknown 
way on the strange quark mass. The expressions (I5.2ip give the first few orders of the 
expansion of /' and U in m s . It is possible that the strange quark mass is so large that 
this expansion simply does not converge. But it is also possible that this expansion does 
(asymptotically) converge, but too slowly for the expressions f)5.2ip to be of practical use. 
In that case, one should either use kaon ChPT, or three-flavor ChPT to higher order than 
p A . Whether the expansion in the strange quark mass converges to some order in ChPT or 
not is quantity dependent. 

In order to obtain the expansion of fx in terms of mi, we can follow two routes. One 
is to simply use the three-flavor result for fx in Eq. f)2.40p . and expand in rri£, as we did 
above for /„.. However, it is instructive to see how one obtains this expansion in kaon ChPT 
independently of the three-flavor ChPT calculation [93]. In order to do this, we need to 
translate the currents 

J L , R = Ql,r1^ l>r , (5.22) 
q = (u d) , 

into the appropriate operators in kaon ChPT. These currents now do not correspond to 
the Noether currents of (softly broken) symmetries, because we are not assuming that the 



100 



Expanding L(mz) in m'^ T /m 2 K gives rise to analytic terms in only. 
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strange quark is small. Therefore, their translations into kaon ChPT do not follow from the 
lagrangian + . We thus introduce spurions Hl,r into these currents, with transfor- 
mation rules such as to make these currents invariant under SU(2)l x SU(2) r : 



J i ,R = QL,R H L,Rlt,s L ,R , with H LR v L , R H L)R , (5.23) 

in which Vl jR G SU(2)l^ r . Making the transition to kaon ChPT, the first two terms in the 
expansion in terms of derivatives of the pion fields are 

J* = C^D^uHr + iC 2 K^A^uH R , (5.24) 

where are new LECs (not present in the lagrangian). The left- and right-handed currents 
are related by parity. In these ChPT expressions for the currents, one can now set R = 

, or Hl,r. = (?) in order to pick out the charged or neutral kaon, corresponding to 



oy vi. 

picking out the up or down quark in Eq. ( I5.23p . respectively. Taking the matrix element of 
the axial current Jf} — Jjl between a kaon state and the vacuum, one finds that 

fx = 2C X (l + j 2 m, - h(2m e ) + 0(m 2 t )^ , (5.25) 

where C is a linear combination of LECs in the kaon EFT lagrangian. 101 Note that the LEC 
C\ is unrelated to /' in Eq. (I5.20p . because there is no SU(3) symmetry in kaon ChPT. If 
we do use SU(3), we may again match the LECs in Eq. (15.251) to the three-flavor ChPT 
result, from which we find 



1, / 8(2L 



d = ~f I 1 - 3L(m s ) + ^ m s ) , (5.26) 



^8(^ + ^)-^(lo g (fj) + l). 

As it should, the coefficient of L(m 2 ) in Eq. (I5.25j) matches that following from Eq. (I2.40p . 

We close this section with another simple example of the translation of a weak operator 
into kaon ChPT j93|. The AS = 2 operator {si'j^d^isL'y^dL) can be written in SU(2) L - 
invariant form by again using the spurion Hl as in 

(s L ^Hlq L )(s L ^Hlq L ) -> B(H{uK)(H\uK) , (5.27) 

where in the second step I gave the leading-order translation into kaon ChPT, introducing 
another weak LEC B. Finally, one sets H^ L = (0 1), because for this choice H^qi = di- 
This technique for finding the ChPT representations of electro-weak operators is of course 
not special to kaon ChPT, and the same method can also be used in three-flavor ChPT Q. 

It turns out that SU(2) ChPT can also be applied to semi-leptonic and nonleptonic kaon 
decays, despite the fact that the final-state pions can have an energy up to half the kaon 
mass. This is not at all obvious, because in the S U(2) fram ework the kaon mass is not a 
small parameter. For more on this topic, see Refs. 1021 . Il03 



101 It turns out that C2 does not contribute to this order. 
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VI. FINITE VOLUME 



As a final topic, I would like to touch, rather briefly, on the fact that all lattice QCD 
computations are, necessarily, performed in a finite spatial volume, and with a finite maximal 
extent L 4 in the (euclidean) time direction. 

Usually, the time dimension of the four-dimensional euclidean volume is chosen larger 
than the spatial directions; large enough so that only the lowest (or lowest few) intermediate 
states contribute to the correlation functions of interest at large times. In order to mimimize, 
or eliminate, systematic effects from the finite extent in spatial directions, the conceptually 
easiest method would be to choose the spatial volume of a cubic box V3 = L 3 (where L is the 
linear size of the spatial box, and we will assume periodic boundary conditions for all fields), 
sufficiently large that one may ignore the effects of L being finite. However, the cost of 
numerical computations grows rapidly with L, and one would thus like to choose the volume 
just large enough that finite-volume effects can be neglected, but not much larger. The 
question then arises whether we have any theoretical insight into what "just large enough" 
means. 

In fact, as we will see, it is possible to develop systematic expansions which provide quan- 
titative information on finite- volume effects, without the introduction of any new parameters 
in the chiral lagrangian. This allows us to quantitatively estimate the effects of working in a 
finite volume, and even to make use of finite- volume effects as a probe of hadronic quantities 
such as the pion decay constant and the chiral condensate. 

The reason that again ChPT gives us the tool to develop these systematic expansions is 
that pions 102 are the lightest hadrons. Thus, if we imagine reducing L from some very large 
value, the pions feel the effects of being in a finite volume first. Therefore, ChPT should 
give us access to the dominant finite- volume effects on many hadronic quantities, if we know 
how to apply it in this setting. 

It already follows from this qualitative discussion that the "figure of merit" for what 
"large" or "small" volume means is the linear size of the box in terms of the wavelength 
of the pion, z = m n L. Clearly, large volume means z>l. But only a more quantitative 
analysis can tell us whether this means, say, z > 3, or whether maybe z > 10 is needed. 

The key theorem in setting up fini te-v olume ChPT states that the lagrangian to be 



used is the infinite- volume lagrangian [104|. The argument goes as follows. First assume 
that the spatial volume is infinite, but that the euclidean time extent L 4 is finite, and 
that the pion fields (15.41) obey periodic boundary conditions in that direction. The pion 
effective theory with these boundary conditions describes the theory at a finite temperature 
T = I/L4. The important observation here is that to calculate pion correlation functions at 
finite temperature, one uses the same chiral lagrangian as at zero temperature (i.e. , infinite 
L4). The temperature, and thus the dependence on L4 enters only through the boundary 
conditions. 103 But, if this is the case, this then also has to be true for the dependence on 
the three s patia l dimensions of the box, if we use periodic boundary conditions also in those 
directions [105 |. In other words, the LECs in a finite- volume calculation should be taken 
the same as in infinite volume. 

This result makes sense from the EFT point of view. If we remember that the LECs 



102 In this section, I will use the word "pions" to refer only to 7r ± and 7r°, and assume that their masses are 
equal, i.e. , that isospin is conserved. 
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For field theory at finite temperature, see the lectures by Owe Philipsen at this school. 
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originated from integrating out high-energy degrees of freedom, they can pick up exponen- 
tially suppressed finite- volume corrections of order exp(— EL), where E is the energy of the 
integrated mode, relative to their infinite-volume values. If we take for example E = m p 
and L = 2 fm, exp(— EL) = 4.5 x 1CT 4 , which is very small in comparison with the present 
precision of our knowledge of LECs. We conclude that we can safely ignore such effects, 
and thus take the LECs to be those of the infinite- volume theory. 104 This more intuitive 
argument also shows that the theorem should hold for more general boundary conditions, 
such as anti-periodic ones. 

It follows that the way in which the volume dependence enters the Feynman rules is 
through the propagator, which now is periodic, since it solves, for particle with mass m„ 

{-D+ml)G L {x-y) = 5{x-y) =nE^-^ + n A) > O 3 - 1 ) 

where L M is the linear extent of the volume in the /i direction, and S is the periodic delta 
function, with period L^ in the \i direction. The solution is, of course, 

G L (x-y) =J2G OQ (x-y + nL) , (6.2) 

with Goo the infinite- volume propagator, and in which nL is the four-vector with components 
n^L^. In words, finite volume effects occur because pions can "travel around the world," 
multiple times in each direction, in a world that is a box with periodic boundary conditions. 
Because of the periodicity, momenta are quantized, p^ = 27rn ll /L fl , with n M integer, and the 
propagator can also be represented by 

Gl{x ~ v) = l-lll^^T^^ (6 - 3) 

where the sum is over the discrete momenta. Feynman rules for vertices are the same as in the 
infinite-volume theory. With these rules, one may extend the calculations of meson masses, 
decay constants, and other quantities to include finite- volume (and/or finite-temperature) 
corrections. We will return to this in Sec. IVI Al 

However, before we extend ChPT calculations to finite volume in this way, we need to 
address a serious physics issue: it is well known that in the chiral limit, the chiral condensate 
vanishes in a finite volume. In order to obtain a nonvanishing condensate, one should take 
the chiral limit after taking the infinite-volume limit, and not before; these two limits do 
not commute. But, if there is no chiral symmetry breaking, that brings into question the 
whole framework of ChPT! 

Let us see what happens in the chiral theory at zero quark mass in finite volume. The 
condensate can be found from the logarithmic derivative of the partition function with 
respect to x}\ 

2B ( dZ 



(QrQl) = =- tr 



= -i/ 2 Bo(tr(£)>, (6.4) 
x=o 4 



104 A corollary is that the chiral theory at nonzero temperature only makes sense for small T, so that 
exp(— E/T) is small. 
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where in the second step we made the transition to the chiral theory. We can decompose 



E(x) = U exp(2i<f)(x) / f) , (6.5) 

with U a constant SU{Nf) matrix, and in which does not contain any constant mode, 
i.e. , / d 4 x(j)(x) = 0. 105 The matrix U represents an element of the coset [SU(Nf)i, x 
SU(Nf)n]/ SU(Nf), and varying U over the group SU(Nf) covers the vacuum manifold, 
cf. Sec. Ill Bt Since U represents an SU(Nf)i symmetry transformation, it follows that for 
vanishing quark masses, the chiral lagrangian does not depend on U. This, in turn, implies 
that the condensate H6.4[) vanishes, because 

dUU = 0, (6.6) 

with dll the Haar measure on the group SU(Nf). 

What we see is that ChPT reproduces the correct behavior in finite volume. The intuitive 
insight is that as long as the volume is large compared to the typical hadronic scale, L ^> 
1/Aqcd, QCD is nonperturbative, and hadrons form. The lightest hadrons are the pions, 
and all that happens if we lower the value of z = m n L is that the pions get very distorted. 
However, if this is the correct picture, ChPT should still be the correct effective theory. 
Only when we make KqcdL too small QCD becomes perturbative, and ChPT is no longer 
the appropriate effective theory. As long as Aq C dL 3> 1, ChPT is the appropriate EFT for 
the low-energy physics of QCD, even if L is small in units of m^. 

When we turn on quark masses, the chiral condensate no longer vanishes. For a degenerate 
quark mass m and a zero mode S = (1/V) / <i 4 xtr(£), the size of the leading-order mass 
term in the chiral lagrangian is of order 

Nff 2 B mVE , (6.7) 

with V = L 4 the four- dimensional finite volume, if we take our volume to be a four- 
dimensional box with equal size in each (euclidean) direction. We may now consider 
different regimes. If we keep Eq. (16. 7p fixed, we choose m ~ 1/L 4 , or , equivalently, 
= y/2Bom ~ 1/L 2 ; we will refer to this regime as the "e-regime" |l06l |. 106 This is a 
different regime from the large-volume regime we considered in Sec. [TTl in which we found 
that m n ~ p ~ 1/L, and thus m ~ 1/L 2 ; this regime is usually referred to as the "p- regime." 
In the e-regime, with e = 1/L, we have that p ~ e, while m n ~ e 2 < p, while in the p-regime 
they are of the same order. 

The kinetic term of the chiral lagrangian, J d A x | tr (d^(f)d^(f)), suppresses the nonzero 
modes 0, and limits the fields 4> to be of order 1/L. If we expand 



U exp(2z0//) = U 1 + -<j> - -0 2 + . . . , (6.8) 




Here I have arbitrarily placed U on the left of exp(2i<j>(x) / f) . More generally, we can write = 
U L exp(2i(/)(x)/f)U R , but if we set U L = UU^ R , we have that S(x) = Uexp(2itf R (j>(x) U R /f), and U R <j>U R 
does not contain a constant mode if <p does not. 

In this small four-dimensional volume, this basically constitutes our definition of the pion mass, as we can 
clearly not define it as the parameter characterizing the large-i behavior of the euclidean pion two-point 
function. 
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it is easy to see that, with m u = ma = m, the first term in 

J d 4 x ^f 2 mB tr(S + S f ) = ^f 2 B mVtr{U + U ] ) (6.9) 

- Jd*x^ mB (tr(U + U ] )<j?) + O(0 4 ) 

is of order L° in the e-regime, while the second term is of order 1/L 2 . To leading order, U is 
nonperturbative: The zero modes in E do not have a gaussian damping, and can thus not 
be taken into account through standard perturbation theory. When we increase the quark 
mass from m ~ 1/L 4 to m ~ 1/L 2 , the first term in Eq. (16. 9p grows to be of order L 2 , 
still dominating the second term, which is now of order L°. This forces U to approach 1, 
making also the zero mode perturbative. We thus make the transition from the e-regime to 
the p-regime. This is also reflected by the behavior of the p = mode contribution to the 
pion propagator (16.31) in finite volume, which, taking all L M = L, is equal to l/(L 4 m 2 ). For 
m 2 = IBtfm ~ 1/L 4 , this is of order one, whereas all other momentum modes contribute 
terms of order 1/L 2 . When m n is of order 1/L, also the zero- momentum contribution is of 
order 1/L 2 , and it does not dominate the propagator. 

There is thus a region in which the chiral expansion in the e-regime coincides with that 
in the p-regime. Both representations have to agree for m ~ 1/L 2 , making it possible to 



match correlation functions between the two regimes |106l |. It is only in the p- regime that 
the Feynman rules in infinite volume, where only the propagator (16 .2p knows about the 
finite- volume, apply. 

Next, we will consider each of these two regimes separately. I will only consider the finite- 
volume physics of pions, because in the real world they are much lighter than kaons, and 
this is now also the case in state-of-the-art lattice QCD computations. Therefore, finite- 
volume effects for kaons and other hadrons will be dominated by their interactions with 
pions. To put it differently, the kaons are always in the p-regime (tuk ^ 1/L; for example, 
for L = 2 fm, m^-L 5), and in the p- regime finite- volume effects due to kaons traveling 
around the world are much suppressed compared to those due to pions. 107 



A. p-regime 

In the p-regime, it is quite straightforward to adapt the infinite-volume expressions of 
ChPT to finite volume. Here, for definiteness, we will take a spatial volume V3 = L 3 , with 
periodic boundary conditions, while, for simplicity, we will assume that the (euclidean) time 
extent is large enough to be taken as infinity. We will also assume that only finite-volume 
effects due to pions are significant. 

Finite- volume effects start occurring at one loop, because in the loops the pion propagator 
gets replaced by its finite- volume version, Eq. (16. 2p . For quantities like meson masses and 
decay constants, cf. Eq. (I2.40p . all one- loop integrals are tadpole integrals, for instance of 
the form 

G-(0) = / 7^ ^ ^ ( A + log U finite) , (6.10) 



(2tt) 4 p 2 + m 



107 



For an investigation including both pions in the e-regime and kaons in the p-regime, see Ref. 107]. 
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where the second equation holds using dimensional regularization, and, of course, we have 
to stay away from the pole at d = 4. In finite volume, this gets replaced by 



Using that 



Gl(0)=G oo (0) + (G l (0)-G oo (0)) . 



I ffl . 

Goo{x) = - - ^- K x {m^x 2 ^ 



4vr 2 



(6.11) 



(6.12) 



in which K% is the modified Bessel function of the second kind of order one, we find that 
G L (0) - Goo(0) = E G oo(x = (nL, 0)) - G^O) (6.13) 

(nL,0)) 

1 



X 



EM 



n. 
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47r 2 



Ki{y/nm^L) . 



Here fc(n) counts the number of vectors n with integer- valued components of length y/n. For 
instance, fc(l) = 6, k(2) = 12, fc(3) = 8, fc(4) = 6, etc. Note that, as one would expect, the 
UV divergence of G%(0) is the same as in infinite volume, because finite- volume effects come 
from pions traveling a "long" distance. Of course, in the one-loop calculations leading to 
Eq. ( 12.4011 . we also encounter integrals such as Eq. (I6.10p with an extra p 2 in the numerator. 
Here we can use that, for x ^ 0, 



dp p e 



2 ipx 



(2tt) 4 p 2 + ml 



d 4 p 

(2tt) 4 p 2 + ml " _ 4^ 



1 ml 



^(mWx 2 ) . 



(6.14) 



We thus fin d, for example, for the finite-volume pion and kaon masses and decay constants 



to one loop [104 



mz 



^ 4fc(n) , _ 



1 + 



(4^/^)2 ^ ^fnm^L 
,2 24 



(6.15) 



A(L) = /Jl-2 
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where 77%^ and f n are the pion mass and decay constant in infinite volume. The kaon mass 
does not receive any finite-volume corrections due to pions, as we can infer from the fact 
that the infinite-volume one-loop contribution has no contribution from pion loops (but 
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only from an 77' loop, cf. Eq. (I2.40p ). In the second of each of these equations, we used the 

asymptotic expansion K\(z) ~ yjn/(2z) exp(— z). The dominant corrections do not come 
from higher orders in the asymptotic expansion of K%, but from the term n = 2 in the sum 
in Eq. (I6.15p . For m n 300 MeV and m n L « 3, the size of the corrections in Eq. (I6.15P is 
of order one percent; making L smaller the corrections grow rapidly in size. Detailed studies 
of finite-volume effects for the pio n and kaon masses and decay constants as well as the r\ 



mass have been m ade i n Ref. (I08j using a partial resummation of hig her l oops based on a 



different approach [109 



TTol . 
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and, for the pion mass to two loops in ChPT 
Many investigations of finite- volume effects in the p- regime have been carried out, using 
both ChPT and other techniques, and it is beyond the scope of these lectures to cover this 
topic in any more detail. However, there is one application in which ChPT can play an 
uniquely important role, and that is the effect of partial quenching on the study of two- 
particle correlation functions in a finite box. 

In a finite volume, the energy spectrum of QCD is discrete, and that also holds for two- 
particle (and many-particle) states. In fact, the energy levels of two-particle states in finite 



volume give information on scattering phase shifts in infinite volume The intuitive 

idea is the following. The normalized wave function of a particle in a box of dimension L 3 
is of order L~ 3 / 2 . If two such particles are put inside the box, and they interact, one thus 
expects a shift in the two-particle energy of order the product of the interaction strength 
time overlap of the two one-particle wave functions, which is of order 1/L 3 . Therefore, the 
dependence of the two-particle energy shifts on the volume gives us information about the 
interacti ons b etween the two particles. Unitarity plays an important role in the systematic 
analysis (ill ], making it not straightforward to generalize these ideas to PQQCD or QCD 
with mixed actions. However, ChPT gives us a handle on this problem, since, as we have 
seen, it can be extended (in euclidean space) to the partially quenched and mixed-action 
cases. The two-particle correlation functions, from which in the unitary case the two-particle 
energy levels are determined, can thus be calculated in PQChPT (or mixed-action ChPT). 
One finds that the double-hairpin contribution to flavor-neutral propagators gives rise to 
unphysical pheno mena such as enhanced finite- volume effects that can be estimated using 



ChPT [58|, |73|, [85|, Ill2| . More study of these intricacies would be interesting. 



B. e-regime 

In this section, we will restrict ourselves to two flavors, under the assumption that the 
strange quark is not in the e-regime. It can thus be taken into account for instance by the 
method described in Sec. |V] Alternatively, one can start with Nf = 3 ChPT, but keep all 
fields describing mesons containing strange quarks in the p-regime. 

Since in this section we will restrict ourselves to two flavors, S G SU(2), and instead of 
the parametrization ( 16. 8 p it is easier to use 

(6.16) 



a 
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For earlier work, see these references. 
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with t the Pauli matrices, and J d 4 x 7ii(x) = 0. With m u = irid = m, V = L 4 , and using 
Eq. (I6.16p . we expand the chiral lagrangian to second order in the pion field: 

Jd A xC = -^f 2 B mV tr(U + U ] ) (6.17) 



1 f \ 1 

+ - J d A x d^n ■ d^n + - B m n ■ jf tx(U + U ] ) 



+ 0(tt 3 



In the e-regime, as we have seen already, the first term and the kinetic term are of order 
one. This makes the mass- dependent 0{n 2 ) term a term of order 1/L 2 , and the whole 0(p A ) 
lagrangian ( I2.39|) of order 1/L 4 . At this point, we observe several differences between the 
p-regime and the e-regime. First, in the p-regime, finite volume effects are exponentially 
suppressed, while in the e-regime, they are suppressed by powers of l/(fL). Then, we also 
see that the chiral expansion in the e-regime rearranges itself: the LECs Li only show up 
at next-to-next-to-leading order in the expansion. To leading order, we thus find for the 
partition function, as a function of the quark mass: 

Z(m) = J [dE] exp (-f 2 B mV tr(U + U^) - - J d 4 x ■ . (6.18) 

Higher order terms in Eq. ( 16.171) can be expanded out from the exponent in the integrand, 
and thus be taken into account systematically. The notation [eE] indicates the SU(2)- 
invariant Haar measure for each £(#). 

In order to proceed, we need to split the measure [d£] into an integration over U and an 
integration over jr. Following Ref. [113], we insert 



J dH Q dH l dH 2 dH i 5^H-yJd 4 x £(x)) (6.19) 



Ii 'dh <[(" S{hU-— I iV.r v:(.r 



H = Hq + zt-H 



where, with h = \H\ = ydet(H), we can write H = hll with U G SU(2), and with dU the 
Haar measure on SU(2). Now we transform variables S = C/E, so that 

£ = cr + i —j- t • 7T , (6.20) 
and use the fact that <i£ = gE to obtain 

Z(m) = fdU I [dt] f[S I d 4 x TCiix)) (6.21) 



i=l 



J x a(x s 



x exp (^f 2 B mV tr(U + u] )~\j d ^ x 9 vtf -9^+3 log J d 

= M JdU exp (^f 2 B mV tv(U + U r ) j , 

where M is independent of the quark mass m. Carrying out the integral over U with 

U = cos 6 1 +i sinO n ■ f , \n\ = 1 , < 9 < ix , (6.22) 

dU = — dQ(n) sin 2 (9) d9 , 
Air 2 
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we find 

Z{m) = ^j^y h (f 2 B mV) , (6.23) 

where I\ is the modified Bessel function of order one. From this, we find for the chiral 
condensate 

/-\ 1 dZ , 2p f l[(f 2 B mV) 1 \ 



m— 



Consistent with our previous discussion, the condensate vanishes for m — > 0. For m^fl fixed 
and V — > oo, Eq. f )6.24p reproduces the infinite- volume result (qq) = —f 2 B , cf. Eq. ( 12.35!) . 
Note that the dimensionless variable f 2 B mV can take any value; to be in the e-regime, the 
requirements are only that f 2 L 2 ^> 1 and BqitlL 2 <C 1. This does not restrict the size of 
their product. 

Equation (I6.24p provides an example of a ChPT calculation in the e-regime to lowest 
order. Starting f rom Eq. (16.171) . it is in principle straightforward to go to higher orders in 



l/(fL) [1131. 1141 ]. For recent work discussing the ChPT calculation of the chiral condensate, 
see Ref. [IlBfT - 



For instance, it is rather easy to see that the m 7 r • tt term in Eq. (" 16 . 1 7[) leads to a 



renormalization of the first term in that lagrangian by [113 



p - 1 - ^vf rf4 ^» ■ = 1 - ^7 E ^ = i - ' (6 - 25) 

where j3\ is a numerical constant |l!3| . and we used that = with integer. 109 

As we have noted before, the 0(p 4 ) LECs Lj only come into play at order 1/L 4 , i.e. , 
at next-to-next-to-leading order in an expansion in l/(f 2 L 2 ). This is to be contrasted with 
the chiral expansion in the p-regime, where they appear already at next-to-leading order. 
This might be helpful for a precise determination of the leading-order LECs (/ and B ), 
because only these two parameters appear in fits using next-to-leading order expressions in 
the e-regime. This is another example in which we can extract physical quantities (/ and 
Bq) from unphysical computations — clearly, a small, euclidean, four-dimensional, finite 
volume with periodic boundary conditions is unphysical, but accessible to lattice QCD. 
This concludes my brief overview of the application of ChPT to the study of volu me 



dependence. For applications to the distribution of topological charge, see Ref. [116| . I 



have already mentioned the extension of the chiral expansion to the partially quenched case 
with some quark masses are in the e-regime, with others in the p-regime (with the obvious 
applicati on t o QCD with 2 + 1 flavors with only the strange quark mass in the p- regime), 
see Ref. 
see Ref. 



T07| l. 

Hi; 



107 . For a recent extension including lattice-spacing artifacts for Wilson fermions, 



1 171 ] : for a recent higher-order study in a partially-quenched setting, see Ref. [118 



These references contain fairly complete pointers to earlier work in the e-regime. 



The sum over p is of course infinite, but we are interested here only in the 1/L 2 part, which is finite. 
The volume-independent infinite part can be absorbed into a multiplicative renormalization of m. In 
dimensional regularization, the "infinite" part actually vanishes. 
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VII. CONCLUDING REMARKS 



Clearly, ChPT is a very useful, and, in practice, indispensable tool for extracting hadronic 
physical quantities from lattice QCD, which, in almost all cases can only obtain correlation 
functions in some unphysical regime. We have seen that this includes nonzero lattice spacing 
and finite volume, necessarily, but also "much more" unphysical situations, such as inde- 
pendent choices for valence and sea quarks on the lattice. The chiral lagrangian, as well 
as other EFTs, such as baryon ChPT, heavy-light ChPT and two-nucleon EFT, are local 
theories, defined in terms of a number of LECs. Once lattice QCD determines the values of 
these LECs, even if this is by matching unphysical correlation functions, they can be used 
to evaluate any hadronic quantity accessible to the various EFTs. While we have restricted 
ourselves in these lectures to the physics of Nambu-Goldstone bosons, the conceptual points 
all carry over to EFTs that include the interactions of other hadrons with Nambu-Goldstone 
bosons. 

There are many applications of ChPT, relevant for lattice QCD, that were left out of these 
lectures. As already mentioned during these lectures, one can extend ChPT to include the 
interactions of pions with heavier hadrons. While we briefly discussed this in the conte xt of 



"heavy" kaons in Sec. [V] the methods describe d there can be extended to baryons [99|, llOl 



and hadrons containing heavy quarks [1191. 11201 ] . For a rece nt review of hadron interactions, 



including tt-tt and baryon-baryon scattering, see Ref. [121 



Some exercises 

Most explicit equations in these lecture notes can be derived with relative ease. None 
of them require more than a one-loop calculation, and in all cases except Eq. (15.161) the 
corresponding diagrams are tadpole diagrams. Here are a few more exercises: 

1. Derive Eq. (|2.15p . which is, in fact, valid for an arbitrary number of flavors. Simplify 
for Nf = 2. 

2. Show that tr(L At L I/ L M L i/ ) can be written as a linear combination of the terms in 
Eq. f)2.17p . using the Cayley-Hamilton theorem, which, for a three- by-three matrix 
A says that 

A 3 - tr(A) A 2 + - (tr(A) 2 - tr(A 2 )) A - det(A) 1 = 0. (7.1) 

[Hint: one way to proceed is to consider tr((L M + L u ) 4 ^.} Simplify Eq. (I2.17P for 
N f = 2. 

3. Show that the scattering lengths for pion scattering in the O(N) linear sigma model 
are discontinuous across the phase transition between the symmetric phase and the 
phase in which O(N) breaks down to 0(N — 1) (cf. discussion after Eq. (I3.27P ). 

4. Consider quenched ChPT (i.e. , no sea quarks at all), with one valence quark and ignore 
the anomaly. Show that using the naive (but wrong!) symmetry group SU(1\1)l <8> 
SU (1\1) r for constructing the chiral lagrangian leads to nonsense for the vacuum. See 
Ref. |61] for more discussion. 
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5. Verify cyclicity of the supertrace on 2 x 2 graded matrices. 

6. Derive Eq. (14.35)) . Show that in the LCE regime all other terms in the chiral lagrangian 
contribute at next-to-leading order. 

7. Investigate possible higher order terms, to order p 4 , in the LCE regime for Wilson 
ChPT (see Ref. Q). 

8. Show that a solution of the equation ur = hu L h exists, for given ur and u^. Here 
Ur } l and h are unitary matrices with determinant one. 

9. Show that Eq. (15. lip is equivalent to Eq. (15.10)) plus a term that can be absorbed into 
the A x term of Eq. lOi| . 

10. Calculate the integral in Eq. (15.16)) and convince yourself of the correctness of the 
discussion following Eq. (J57TTJ) . 

11. Extend the discussion of Sec. IIII CI to Nf = 3 degenerate light quarks (in the LCE 
regime). I do not know the answer — as far as I know, you can publish the result! 
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